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(54) Rice promoters 

(57) The present invention provides several promot- 
ers isolated from Qryza sativa, which promoters are ca- 
pable of driving and/or regulating the expression of an 
operably linked nucleic acid in a plant. The expression 
patterns of the promoters according to the present inven- 
tion have been studied in Oryza sativa and some of the 



promoters displayed specific activity in particular cells, 
tissues or organs of the plant, while others displayed con- 
stitutive expression throughout substantially the whole 
plant Some promoters showed weak expression, while 
others were strongly active. 
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Description 

[0001] The present invention relates to the field of plant molecular biology, more particularly to nucleic acid sequences 
useful for driving and/or regulating expression of an operably linked nucleic acid in plants. The isolation of these nucleic 

5 acid sequences from rice, as well as their use in driving and/or regulating expression of an operably linked nucleic acid 
is disclosed. The present invention therefore concerns promoters, hybrid promoters, genetic constructs, expression 
cassettes, transformation vectors, expression vectors, host cells and transgenic plants comprising the isolated nucleic 
acids according to the present invention. The present invention also concerns methods for driving and/or regulating 
expression of a nucleic acid and methods for the production of transgenic plants. 

to [0002] Gene expression is dependent on initiation of transcription, which is mediated via the transcription initiation 
complex. Gene expression is also dependent on regulation of transcription, which regulation determines how strong, 
when or where a gene is expressed. Said regulation of gene expression may be mediated via transcriptional control 
elements, which are generally embedded In the nucleic acid sequence S'-flanking or upstream of the expressed gene. 
This upstream nucleic acid region is often referred to as a "promoter" since it promotes the binding, formation and/or 

is activation of the transcription initiation complex and therefore is capable of driving and/or regulating expression of the 
3' downstream nucleic acid sequence. 

[0003] Genetic engineering of plants with the aim of obtaining a useful plant phenotype, often Involves heterologous 
gene expression, which is generally mediated by a promoter capable of driving and/or regulating expression of an 
operably linked heterologous nucleic acid. The phenotype of the host plant only depends on the contribution of the 

20 heterologous nucleic acid, but also on the contribution of the specific expression pattern of the chosen promoter deter- 
mining how, where and when that heterologous nucleic acid is expressed. Accordingly, the choice of promoter with a 
suitable expression pattern is of crucial importance for obtaining the suitable phenotype. A person skilled in the art will 
need to have available different promoters, to determine the optimal promoter for a particular nucleic acid. For many 
different host plants, this availability is rather limited and there is therefore a continuing need to provide new promoters 

25 with various expression profiles. 

[0004] The nucleic acids as presented in SEQ ID NO 1 to 22 were Isolated from Oryza saliva and have been found 
to be capable of driving and regulating expression of an operably linked nucleic acid; their expression patterns have 
also been characterized. Therefore the present invention offers a collection of hitherto unknown isolated nucleic acids, 
which isolated nucleic acids are useful as promoters. 

30 [0005] Accordingly, the present invention provides an isolated promotercapable of driving and/or regulating expression, 
comprising: 

(a) an isolated nucleic acid as given In any one of SEQ ID NO 1 to 22 or the complement of any one of SEQ ID NO 
1 to-22; or 

35 (b) an isolated nucleic acid having at least 90% sequence identity with any of the DNA sequences as given in any 

one of SEQ ID NO 1 to 22; or 

(c) an isolated nucleic acid specifically hybridizing under stringent conditions with any of the DNA sequences as 
given in any one of SEQ ID NO 1 to 22; or 

(d) an isolated nucleic acid as defined in any one of (a) to (c), which is interrupted by an intervening sequence; or 
40 (e) a fragment of any of the nucleic acids as defined in (a) to (d), which fragment is capable of driving and/or regulating 

expression. 

[0006] The term "isolated" as used herein means being removed from its original source. Preferably, the "isolated" 
promoter is free of sequences (such as protein encoding sequences or other sequences at the 3' end) that naturally 

45 flank the promoter in the genomic DNA of the organism from which the promoter is derived. Further preferably, the 
"isolated" promoter is also free of sequences that naturally flank it atthe 5' end. Further preferably, the "isolated" promoter 
may comprise less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 .5 kb, 1 .2 kb, 1 kb, 0.8 kb, 0.5 kb or 0. 1 kb of nucleotide sequences 
that naturally occur with the promoter in genomic DNA from the organism of which the promoter is derived. 
[0007] The present invention is not limited to the nucleic acids as presented by SEQ ID NO 1 to 22. A person skilled 

so in the art will recognize that variants or fragments of a nucleic acid may occur, whilst maintaining the same functionality. 
These variants or fragments may be man made (e.g. by genetic engineering) or may even occur In nature. Therefore 
the present invention extends to variant nucleic acids and fragments of any of SEQ ID NO 1 to 22, which variants or 
fragments are useful in the methods of the present invention. Such variants and fragments include: 

55 (a) an isolated nucleic acid as given in any one of SEQ ID NO 1 to 22 or the complement of any one of SEQ ID NO 

1 to 22; or 

(b) an isolated nucleic acid having at least 90% sequence identity with any of the DNA sequences as given in any 
one of SEQ ID NO 1 to 22; or 
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(c) an Isolated nucleic acid specifically hybridizing under stringent conditions with any of the DNA sequences as 
given in any one of SEQ ID NO 1 to 22; or 

(d) an isolated nucleic acid as defined in any one of (a) to (c), which is interrupted by an intervening sequence; or 

(e) a fragment of any of the nucleic acids as defined in (a) to (d), which fragment is capable of driving and/or regulating 
s expression. 

[0008] Suitable variants of any one of SEQ ID NO 1 to 22 encompass homologues which have in increasing order of 
preference at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with any one of the 
nucleic acids as represented in SEQ ID NO 1 to 22. 

w [0009] The percentage of identity may be calculated using an alignment program. Preferably a pair wise global align- 
ment program may be used, which implements the algorithm of Needleman-Wunsch (J. Mol. Biol. 48: 443-453, 1970). 
This algorithm maximizes the number of matches and minimizes the number of gaps. Such programs are for example 
GAP, Needle (EMBOSS package), stretcher (EMBOSS package) or Align X (Vector NTI suite 5.5) and may use the 
standard parameters (for example gap opening penalty 15 and gap extension penalty 6.66). Alternatively, a local align- 

15 ment program implementing the algorithm of Smith-Waterman (Advances in Applied Mathematics 2, 482-489 (1 981)) 
may be used. Such programs are for example Water (EMBOSS package) or matcher (EMBOSS package). "Sequence 
identity* as used herein is preferably calculated over the entire length of the promoters as represented by any one of 
SEQ ID NO 1 to 22. The length of these promoters is presented in Table 2. 

[0010] Search and identification of homologous nucleic acids, would be well within the realm of a person skilled in the 
20 art. Such methods, involve screening sequence databases with the sequences provided by the present Invention, for 
example any one of SEQ ID NO 1 to 22, preferably in a computer readable form. Useful sequence databases, include 
but are not limited to Genbank (http:/www.ncbLnlm.nih.gov/web/Genbank ), the European Molecular Biology Laboratory 
Nucleic acid Database (EMBL) (http:/w.ebLac.uk/ebi-docs/embl-db.html) or versions thereof, orthe MIPS database (http: 
//mips.gsf.de/). Different search algorithms and software for the alignment and comparison of sequences are well known 
25 in the art. Such software includes, for example GAP, BESTFIT, BLAST, FASTA and TFASTA. Preferably BLAST software 
is used, which calculates percent sequence identity and performs a statistical analysis of the similarity between the 
sequences. The suite of programs referred to as BLAST programs has 5 different implementations: three designed for 
nucleotide sequence queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries 
(BLASTP andTBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; BlrrenetaL, GenomeAnalysls, 1:543, 1997). 
30 The software for performing BLAST analysis is publicly available through the National Centre for Biotechnology Infor- 
mation. 

[0011] The sequences of the genome of Arabidopsis thatiana and the genome of Otyza sativa are now available in 
public databases such as Genbank. Other genomes are currently being sequenced. Therefore, it is expected that as 
more sequences of the genomes of other plants become available, homologous promoters may be identifiable by 

35 sequence alignment with any one of SEQ ID NO 1 to SEQ ID NO 22. The skilled person will readily be able to find 
homologous promoters from other plant species, for example from other crop plants, such as maize. Homologous 
promoters from other crop plants are especially useful for practising the methods of the present invention in crop plants. 
[0012] One example of homologues having at least 90% sequence identity with any one of SEQ ID NO to 22 are allelic 
variants of any one of SEQ ID NO 1 to 22. Allelic variants are variants of the same gene occurring in two different 

<o individuals of the same species and usually allelic variants differ by slight sequence changes. Allelic variants may 
encompass Single Nucleotide Polymorphisms (SNPs) as well as Small Insertion/Deletion Polymorphisms (INDELs). 
The size of INDELs is usually less than 1 00 bp. SNPs and INDELs form the largest set of sequence variants In naturally 
occurring polymorphic strains of most organisms. 

[0013] Homologues suitable for use in the methods according to the invention may readily be isolated from their source 
45 organism via the technique of PCR or hybridization. Their capability of driving and/or regulating expression may readily 
be determined, for example, by following the methods described in the Examples section by simply substituting the 
sequence used in the actual Example with the homologue. 

[0014] Other suitable variants of any one of SEQ ID NO 1 to 22 encompassed by the present invention are nucleic 
acids specifically hybridising under stringent conditions to any one of the nucleic acids of SEQ ID NO 1 to 22. The term 

50 "hybridising" means annealing to substantially homologous complementary nucleotide sequences in a hybridization 
process. Tools in molecular biology relying on such a hybridization process include the polymerase chain reaction (PCR; 
and all methods based thereon), sub-tractive hybridisation, random primer extension, nuclease S1 mapping, primer 
extension, reverse transcription, cDNA synthesis, differential display of RNAs, and DNA sequence determination, North- 
em blotting (RNA blotting), Southern blotting (DNA blotting). The hybridisation process can also occur with one of the 

55 complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose beads or any other resin. 
Tools in molecular biology relying on such a process include the isolation of poly (A+) mRNA. The hybridisation process 
can furthermore occur with one of the complementary nucleic acids immobilised to a solid support such as a nitro- 
cellulose or nylon membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support (the 
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latter known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools In molecular biology relying on such 
a process include RNA and DNA gel blot analysis, colony hybridisation, plaque hybridisation, in situ hybridisation and 
microarray hybridisation. In order to allow hybridisation to occur, the nucleic acid molecules are generally thermally or 
chemically denatured to melt a double strand into two single strands and/or to remove hairpins or other secondary 

s structures from single stranded nucleic acids. The stringency of hybridisation is influenced by conditions such as tem- 
perature, salt concentration and hybridisation buffer composition. Conventional hybridisation conditions are described 
in, for example, Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition Cold Spring Harbor Laboratory 
Press, CSH, New York, but the skilled craftsman will appreciate that numerous different hybridisation conditions can be 
designed in function of the known or the expected homology and/or length of the nucleic acid sequence. High stringency 

td conditions for hybridisation include high temperature and/or low sodium/salt concentration (salts include sodium as for 
example in NaCI and Nag-citrate) and/or the inclusion of formamlde in the hybridisation buffer and/or lowering the 
concentration of compounds such as SDS (sodium dodecyl sulphate detergent) in the hybridisation buffer and/orexclusion 
of compounds such as dextran sulphate or polyethylene glycol (promoting molecular crowding) from the hybridisation 
buffer. Specifically hybridising under stringent conditions means that the sequences have to be very similar. Specific 

15 hybrisization under stringent conditions is preferably carried out at a temperature of 60°C followed by washes in 0.1 to 
1 XSSC, 0.1XSDS, and 1X SSC, 0.1 X SDS. 

[0015] The invention also relates to a nucleic acid molecule of at least 1 5 nucleotides in length hybridizing specifically 
with any of the nucleic acids of the invention. The Invention also relates to a nucleic acid molecule of at least 1 5 nucleotides 
in length specifically amplifying a nucleic acid of the invention by polymerase chain reaction. 

20 [0016] Another variant of any of SEQ ID NO 1 to 22 encompassed by the present invention are nucleic acids corre- 
sponding to any one of SEQ ID NO 1 to 22 or variants thereof as described hereinabove, which are interrupted by an 
intervening sequence. For example, any of the nucleic acids as presented in SEQ ID NO 1 to 22 may be Interrupted by 
an intervening sequence. With "intervening sequences" is meant any nucleic acid or nucleotide, which disrupts another 
sequence. Examples of intervening sequences comprise introns, nucleic acid tags, T-DNA and mobflizable nucleic acids 

25 sequences such as transposons or nucleic acids that can be mobilized via recombination. Examples of particular trans- 
posons comprise Ac (activator). Ds (Dissociation), Spm (suppressor-Mutator) or En. The introduction of introns into 
promoters is now widely applied. The methods according to the present invention may also be practised using a nucleic 
acid sequence according to any one of SEQ ID NO 1 to 22 provided with an intron. In case the intervening sequence is 
an intron, alternative splice variants of the nucleic acids according to the invention may arise. The term "alternative splice 

30 variant" as used herein encompasses variants of a nucleic acid sequence in which intervening introns have been excised, 
replaced or added. Such splice variants may be found in nature or may be manmade. Methods for making such promoters 
with an intron or for making the corresponding splice variants are well known in the art. 

[0017] Variants interrupted by an intervening sequence, suitable for use In the methods according to the invention 
may readily be determined for example by following the methods described in the Examples section by simply substituting 

35 the sequence used in the actual Example with the variant 

[0018] The variant nucleic acids as described hereinabove may be found in nature (for example allelic variants or 
splice variants). Additionally and/or alternatively, variants of any one of SEQ ID NO 1 to 22 as described hereinabove 
may be manmade via techniques well known in the art involving for example mutation, substitution, insertion, deletions 
or derivation. The present invention also encompasses such variants, as well as their use in the methods of the present 

^0 invention. 

[0019] A "mutation variant" of a nucleic acid may readily be made using recombinant DNA manipulation techniques 
or nucleotide synthesis. Examples of such techniques include site directed mutagenesis via M13 mutagenesis, T7-Gen 
in vitro mutagenesis (USB, Cleveland, OH), QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), 
PCR-mediated site-directed mutagenesis or other site-directed mutagenesis protocols. Alternatively, the nucleic acid of 

45 the present invention may be randomly mutated. 

[0020] A "substitutional variant" refers to those variants in which at least one residue in the nucleic acid sequence has 
been removed and a different residue inserted in its place. Nucleic acid substitutions are typically of single residues, but 
may be clustered depending upon functional constraints placed upon the nucleic acid sequence; insertions usually are 
of the order of about 1 to about 10 nucleic acid residues, and deletions can range from about 1 to about 20 residues. 

so [0021] An "insertiona! variant" of a nucleic acid is a variant in which one or more nucleic acid residues are introduced 
into a predetermined site in that nucleic acid. Insertions may comprise 5'-termlnal and/or 3'-terminal fusions as well as 
intra-sequence insertions of single or multiple nucleotides. Generally, insertions within the nucleic acid sequence will 
be smaller than 5'- or 3'-terminal fusions, of the order of about 1 to 1 0 residues. Examples of 5'- or 3'-terminal fusions 
include the coding sequences of binding domains or activation domains of a transcriptional activator as used in the yeast 

55 two-hybrid system or yeast one-hybrid system, or of phage coat proteins, (histidtne) 6 -tag, glutathione S-transferase-tag, 
protein A t maitose-binding protein, dihydrofolate reductase, Tag»1 00 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP 
(calmodulin-binding peptide), HA epitope, protein C epitope and VSV epitope. 

[0022] The term "derivative" of a nucleic acid may comprise substitutions, and/or deletions and/or additions of naturally 



4 



EP 1 801 223 A2 



and non-naturally occurring nucleic acid residues compared to the natural nucleic acid. Derivatives may, for example, 
comprise methylated nucleotides, or artificial nucleotides. 

[0023] Also encompassed with in the present invention are promoters, comprising a fragment of any of the nucleic 
acids as presented by any one of SEQ ID NO 1 to 22 or variants thereof as described hereinabove. A "fragment* as 
s used herein means a portion of a nucleic acid sequence. Suitable fragments useful in the methods of the present invention 
are functional fragments, which retain at least one of the functional parts of the promoter and hence are still capable of 
driving and/or regulating expression. Examples of functional fragments of a promoter include the minimal promoter, the 
upstream regulatory elements, or any combination thereof. 

[0024] Suitable fragments may range from at least about 20 base pairs or about 50, 1 00, 150, 200, 250, 300, 350, 
w 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950 or 1 000 base pairs, up to about the full length sequence of 
the invention. These base pairs are typically immediately upstream of the transcription initiation start, but alternatively 
may be from anywhere in the promoter sequence. 

[0025] Suitable fragments useful in the methods of the present invention may be tested for their capability of driving 
and/or regulating expression by standard techniques well known to the skilled person, or by the following method de- 

15 scribed in the Example section. 

[0026] The promoters as disclosed in any one of SEQ ID NO 1 to 22 are isolated as nucleic acids of approximately 
1,2kb from the upstream region of particular rice coding sequences (CDS). These nucleic acids may include typical 
elements of a promoter, which are presented in Figure 1 . Generally, a promoter may comprises from coding sequence 
to the upstream direction: (i) an 5'UTR of pre-messenger RNA, (ii) a minimal promoter comprising the transcription 

20 initiation element (INR) and more upstream a TATA box, and (iii) may contain regulatory elements that determine the 
specific expression pattern of the promoter. 

[0027] The term "promoter 11 as used herein is taken in a broad context and refers to regulatory nucleic acid sequences 
capable of effecting (driving and/or regulating) expression of the sequences to which they are operably linked. A"promoter" 
encompasses transcriptional regulatory sequences derivedf rom a classical genomic gene. Usually a promotercomprises 

25 a TATA box, which is capable of directing the transcription initiation complex to the appropriate transcription initiation 
start site. However, some promoters do not have a TATA box (TATA-less promoters), but are still fully functional for 
driving and/or regulating expression. A promoter may additionally comprise a CCAAT box sequence and additional 
regulatory elements (i.e. upstream activating sequences or cis-elements such as enhancers and silencers). A "promoter" 
may also include the transcriptional regulatory sequences of a classical prokaryotic gene, in which case it may include 

so a -35 box sequence and/or a -1 0 box transcriptional regulatory sequences. 

[0028] "Driving expression" as used herein means promoting the transcription of a nucleic acid. 
[0029] "Regulating expression" as used herein means influencing the level, time or place of transcription of a nucleic 
acid. The promoters of the present invention may thus be used to increase, decrease or change in time and/or place 
transcription of a nucleic acid. For example, they may be used to limit the transcription to certain cell types, tissues or 

35 organs, or during a certain period of time, or in response to certain environmental conditions. 

[0030] The promoter is preferably a plant-expresstble promoter. The term "plant-expressible" means being capable 
of regulating expression In a plant, plant cell, plant tissue and/or plant organ. Accordingly, the Invention encompasses 
an isolated nucleic acid as mentioned above, capable of regulating transcription of an operably linked nucleic acid in a 
plant or in one or more particular cells, tissues or organs of a plant. 

40 [0031] The expression pattern of the promoters according to the present invention were studied in detail and it was 
found that many of them were tissue-specific. Accordingly, the present invention provides "tissue-specific" promoters. 
The term "tissue-specific" shall be taken to indicate that expression is predominantly in a particular tissue, tissue-type, 
organ or any other part of the organism, albeit not necessarily exclusively in said tissue, tissue-type, organ or other part. 
Accordingly, the Invention encompasses an isolated nucleic acid as mentioned above, capable of driving and/or regulating 

4£ expression (of an operably linked nucleic acid) in a tissue-specific manner. Expression may be driven and/or regulated 
in the seed, embryo, scutellum, ateurone, endosperm, leaves, flower, calll, meristem, shoot meristem, discriminating 
centre, shoot, shoot meristem and root. In grasses the shoot meristem is located in the so-called discrimination zone 
from where the shoot and the leaves originate. 

[0032] A tissue-specific promoter Is one example of a so-called "regulated promoter". These promoters are regulated 
so by endogenous signals such as the presence of certain transcription factors, metabolites, plant hormones, or exogenous 
signals, such as ageing, stresses or nutritional status. These regulations may have an effect on one or more different 
levels such spatial specificity or temporal specificity. Encompassed within the present invention is a nucleic acid as 
described hereinabove, which is a "regulated promoter". Examples of regulated promoters are cell-specific promoters, 
tissue-specific promoters, organ-specific promoters, cell cycle-specific promoters, inducible promoters or young tissue- 
55 specific promoters. 

[0033] Alternatively and/or additionally, some promoters of the present invention display a constitutive expression 
pattern. Accordingly, the present invention provides a promoter as described hereinabove, which is a constitutive pro- 
moter. The term "constitutive" means having no or very few spatial or temporal regulations. The term "constitutive 



5 



EP 1 801 223 A2 



expression 11 as used herein refers to a substantially continuously expression in substantially all tissues of the organism. 
The skilled craftsman will understand that a "constitutive promoter" Is a promoter that is active during most, but not 
necessarily ad, phases of growth and development of the organism and throughout most, but not necessarily ad, parts 
of an organism. 

[0034] The "expression pattern" of a promoter is not only influenced by the spatial and temporal aspects, but also by 
the level of expression. The level of expression is determined by the so-called "strength" of a promoter. Depending on 
the resulting expression level, a distinction is made herein between 'weak" or "strong" promoters. Generally by "weak 
promoter" is meant a promoter that drives expression of an operably (inked nucleic acid at levels of about 1/10000 
transcripts to about 1/100000 transcripts to about 1/5O000O transcripts. Generally, by "strong promoter* is meant a 
promoter that drives expression at levels of about 1/10 transcripts, to about 1/100 or to about 1/1000 transcripts. 
[0035] According to a particular embodiment, the invention provides an isolated promoter as mentioned hereinabove, 
which is a hybrid promoter. The term "hybrid promoter" as used herein refers to a chimeric promoter made, for example, 
synthetically, for example by genetic engineering. Preferred hybrid promoters according to the present invention comprise 
a part, preferably a functional part, of one of the promoters according to the present invention and at least another part, 
preferably a functional part of a promoter. The latter part, may be a part of any promoter, including any one of the 
promoters according to the present invention and other promoters. One example of a hybrid promoter comprises regu- 
latory element(s) of a promoter according to the present invention combined with the minimal promoter of another 
promoter. Another example of a hybrid promoter is a promoter comprising additional regulatory elements to further 
enhance its activity and/or to alter its spatial and/or temporal expression pattern. 

[0036] The present invention also provides use of a functional fragment of any one of SEQ ID NO 1 to 22 or variant 
thereof for changing the expression pattern of a promoter. In such methods, at least part of any of the nucleic acids 
according to the present invention are combined with at least one fragment of another promoter. 
[0037] Further, the invention provides a genetic construct comprising: 

(a) An isolated promoter as defined hereinabove 

(b) A heterologous nucleic acid sequence operably linked to isolated promoter of (a), and optionally 

(c) A 3' transcription terminator 

[0038] The term "genetic construct" as used herein means a nucleic acid made by genetic engineering. 
[0039] The term "operably linked" to a promoter as used herein means that the transcription is driven and/or regulated 
by that promoter. A person skilled in the art will understand that being operably linked to a promoter preferably means 
that the promoter is positioned upstream (i.e. at the 5'-end) of the operably linked nucleic acid. The distance to the 
operably linked nucleic acid may be variable, as long as the promoter of the present invention is capable of driving and/or 
regulating the transcription of the operably linked nucleic acid. For example, between the promoter and the operably 
(inked nucleic acid, there might be a cloning site, an adaptor, a transcription or translation enhancer. 
[0040] The operably linked nucleic acid may be any coding or non-coding nucleic acid. The operably linked nucleic 
acid may be in the sense or in the anti-sense direction. Typically in the case of genetic engineering of host cells, the 
operably linked nucleic acid is to be Introduced into the host cell and is intended to change the phenotype of the host 
cell. Alternatively, the operably linked nucleic acid is an endogenous nucleic acid from the host cell. 
[0041] The term "heterologous" as used herein is^ntendedtobe ,, heterologoustothep^omoterofthepresentinvention ,, . 
A nucleic acid that is heterologous to the promoter of the present invention is not naturally occurring in the nucleic acid 
sequences flanking the promoter of the present invention when it is in its biological genomic environment. While the 
nucleic acid may be heterologous to the promoter of the present invention, it may be homologous or native or heterologous 
or foreign to the plant host cell. The heterologous operably linked nucleic acid may be any nucleic acid (for example 
encoding any protein), provided that it comprises or it is flanked by at least one nucleotide which is normally not flanking 
the promoter of the present Invention. 

[0042] The term 'transcription terminator" as used in (c) refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3'-non-translated DNA sequences usually containing a poiy- 
adenylation signal, which facilitates the addition of polyadenylate sequences to the 3'-end of a primary transcript. Ter- 
minators active in and/or isolated from viruses, yeasts, moulds, bacteria, insects, birds, mammals and plants are known 
and have been described in literature. Examples of terminators suitable for use in the genetic constructs of the present 
invention include the Agrobacterium tumefaciens nopaline synthase (NOS) gene terminator, the Agrobacterium tume- 
faciens octopine synthase (OCS) gene terminator sequence, the Cauliflower mosaic virus (CaMV) 35S gene terminator 
sequence, the Oryza sativa ADP-glucose pyrophosphorylase terminator sequence (t3'Bt2), the Zea mays zein gene 
terminator sequence, the rbcs-1A gene terminator, and the rbcs-3A gene terminator sequences, amongst others. 
[0043] The present invention also provides an expression cassette, a transformation vector or a plant expression 
vector comprising a genetic construct as described above. 

[0044] An "expression cassette" as meant herein refers to a minimal genetic construct necessary for expression of a 
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nucleic acid. A typical expression cassette comprises a promoter-gene-terminator combination. An expression cassette 
may additionally comprise cloning sites, for example GatewayTM recombination sites or restriction enzyme recognition 
sites, to allow easy cloning of the operabiy linked nucleic acid or to allow the easy transfer of the expression cassette 
into a vector. An expression cassette may further comprise 5' untranslated regions, 3' untranslated regions, a selectable 

5 marker, transcription enhancers or translation enhancers. 

[0045] With "transformation vector* is meant a genetic construct, which may be introduced in an organism by trans- 
formation and may be stably maintained in said organism. Some vectors may be maintened in for example Escherichia 
coii, A. tumefaciens, Saccharvmyces cerevisiae or Schizosaccharomyces pornbe f while others such as phagemids and 
cosmid vectors, may be maintained in bacteria and/or viruses. Transformation vectors may be multiplied in their host 

to cell and may be isolated again therefrom to be transformed into another host cell. Vector sequences generally comprise 
a set of unique sites recognized by restriction enzymes, the multiple cloning site (MCS), wherein one or more non-vector 
sequence(s) can be inserted. Vector sequences may further comprise an origin of replication which is required for 
maintenance and/or replication in a specific host cell. Examples of origins of replication Include, but are not limited to, 
the f1-ori and colEL 

is [0046] "Expression vectors" form a subset of transformation vectors, which, by virtue of comprising the appropriate 
regulatory sequences, enable expression of the inserted non-vector sequence(s). Expression vectors have been de- 
scribed which are suitable for expression in bacteria (e.g. £ co//), fungi (e.g. S. cerevisiae, S, pombe, Pichia pastoris), 
insect ceils (e.g. baculoviral expression vectors), animal cells (e.g. COS or CHO cells) and plant cells. One suitable 
expression vector according to the present invention is a plant expression vector, useful for the transformation of plant 

20 cells, the stable integration in the plant genome, the maintenance in the plant cell and the expression of the non-vector 
sequences in the plant cell. 

[0047] Typically, a plant expression vector according to the present invention comprises a nucleic acid of any one of 
SEQ ID NO 1 to 22 or a variant thereof as described hereinabove, optionally operabiy linked to a second nucleic acid. 
Typically, a plant expressible vector according to the present invention, further comprises T-DNA regions for stable 

25 integration into the plant genome (for example the left border and the right border regions of the Ti plasmid). 

[0048] The genetic constructs of the invention may further comprise a ^selectable marker". As used herein, the term 
"selectable marker" includes any gene, which confers a phenotype to a cell in which it is expressed, to facilitate the 
identification and/or selection of cells that are transfected ortransfomned. Suitable markers may beselected from markers 
that confer antibiotic or herbicide resistance. Cells containingthe genetic construct will thussurvive antibiotics or herbicide 

30 concentrations that kill untransformed cells. Examples of selectable marker genes include genes conferring resistance 
to antibiotics (such as nptll encoding neomycin phosphotransferase capable of phosphorylating neomycin and kanamycin, 
or hpt encoding hygromycin phosphotransferase capable of phosphorylating hygromycin), to herbicides (for example 
bar which provides resistance to Basta; aroA or gox providing resistance against glyphosate), or genes that provide a 
metabolic trait (such as man A that allows plants to use mannose as sole carbon source). Visual marker genes result in 

35 the formation of colour (for example beta-glucuronidase, GUS), luminescence (such as luciferase) or fluorescence 
(Green Fluorescent Protein, GFP, and derivatives thereof). Further examples of suitable selectable marker genes include 
the ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), bacterial kanamycin resistance gene (Kanr), phos- 
phinothricin resistance gene, and the chloramphenicol acetyltransferase (CAT) gene, amongst others. 
[0049] Furthermore, the present invention encompasses a host cell comprising an isolated promoter, or a genetic 

40 construct, or an expression cassette, or a transformation vector or an expression vector according to the Invention as 
described hereinabove. In particular embodiments of the invention, the host cell is selected from bacteria, algae, fungi, 
yeast, plants, insect or animal host cells, 

[0050] In one particular embodiment, the invention provides a transgenic plant cell comprising an isolated promoter 
according to the invention, or an isolated nucletcacid, or a genetic construct, oran expression cassette, or a transformation 
45 vector or an expression vector according to the invention as described hereinabove. Preferably said plant cell is a dicot 
plant cell or a monocot plant cell, more preferably a cell of any of the plants as mentioned herein. Preferably, in the 
transgenic plant cell according to the invention, the promoter or the genetic construct of the invention is stably integrated 
into the genome of the plant cell. 

[0051] The invention also provides a method for the production of a transgenic plant, comprising: 

50 

(a) Introducing into a plant cell an isolated promoter, for example any one of SEQ ID NO 1 to SEQ ID NO 22, or a 
variant or fragment thereof, or a genetic construct, or an expression cassette, or a transformation vector or an 
expression vector according to the present invention and as described hereinabove, and 

(b) Cultivating said plant cell under conditions promoting plant growth. 

55 

[0052] Introducing 1 the above mentioned isolated promoter, or genetic construct, or expression cassette, or transfor- 
mation vector or expression vector, into a host cell (e.g. plant cell) Is preferably achieved by transformation. The term 
"transformation* as used herein encompasses the transfer of an exogenous polynucleotide into a host cell, irrespective 
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of the method used for transfer. In particularfor plants, tissues capable of clonal propagation, whether by organogenesis 
or embryogenesis, are suitable to be transformed with a genetic construct of the present invention and a whole plant 
may be regenerated therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular plant species being transformed. Exemplary tissue targets include leaf 

5 disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g., 
apical meristem, axillary buds, and root meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl 
meristem). The polynucleotide may be transiently or stably introduced into a plant cell and may be maintained non- 
integrated, for example, as a plasmid. Alternatively, it may be integrated into the plant genome. 
[0053] Transformation of a plant species is now a fairly routine technique. Advantageously, any of several transfer- 

?0 matton methods may be used to introduce the nucleic acids of the invention into a suitable ancestor cell. Transformation 
methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA 
directly into the plant, particle gun bombardment, transformation using viruses or pollen and microprojection. Methods 
may be selectedfrom the calcium/polyethylene glycol method for protoplasts (Krens, F.A. et al., 1 882, Nature 296, 72-74; 
Negrutiu I. et al., June 1987, Plant Mol. Biol. 8, 363-373); electroporation of protoplasts (Shillito R.D. et aL, 19B5 Bio/ 

15 Technol 3, 1 099-1 1 02); microinjection into plant material (Crossway A. et al., 1 986, Mol. Gen Genet 202, 1 79-1 85); DNA 
or RNA-coated particle bombardment (Klein T.M. et aL, 1987, Nature 327, 70) infection with (non-integrative) viruses 
and the like. A preferred transformation method for the production of transgenic plant cells according to the present 
invention, is an Agrobacterium mediated transformation method. 

[0054] Transgenic rice plants comprising any one of the promoters of the present invention are preferably produced 
20 via Agrobacterium-mediated transformation using any of the well-known methods for rice transformation, such as the 
ones described in any of the following: published European patent application EP 1 1 98985 A1 , Aldemita and Hodges 
(Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 491-506, 1993); Hiei et al. (Plant J. 6 (2) 271-282, 
1 994); which disclosures are incorporated by reference herein as if fully set forth. In the case of corn transformation, the 
preferred method is as described in either Ishida et al. (Nat. Biotechnol. 1 996 Jun; 14(6): 745-50) or Frame et al. (Plant 
25 Physiol. 2002 May; 129(1): 13-22), which disclosures are incorporated by reference herein as if fully set forth. 

[0055] Generally aftertransformation, plant cells or cell groupings are selected forthe presence of one or more markers 
which are encoded by plant-expressible genes co-transferred with the gene of interest (which could be under the control 
of any of the promoters of the present Invention), following which the transformed material may be cultivated under 
conditions promoting plant growth. 
30 [0056] The resulting transformed plant cell may then be used to regenerate a transformed plant in a manner known 
to persons skilled in the art Accordingly, the method for the production of a transgenic plant as described hereinabove, 
may further comprise regenerating a plant from said plant cell of (a). 

[0057] The present invention further provides a plant comprising a plant cell as described hereinabove. The plants 
may also be able to grow, or even reach maturity including for example fruit production, seed formation, seed ripening 
35 and seed setting. 

[0058] Furthermore, progeny may be produced from these seeds, which progeny may be fertile. Alternatively or 
additionally, the transformed and regenerated plants may also produce progeny by non-sexual propagation such as 
cloning, grafting. The generated transformed plants may be propagated by a variety of means, such as by clonal prop- 
agation or classical breeding techniques. For example, a first generation (or T1 ) transformed plant may be selfed to give 
homozygous second generation (orT2) transformants, and the T2 plants further propagated through classical breeding 
techniques. 

[0059] The generated transformed organisms may take a variety of forms. For example, they may be chimeras of 
transformed cells and non-transformed cells; clonal transformants (e.g., all cells transformed to contain the expression 
cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an un- 
45 transformed scion). 

[0060] Following DNA transfer and growth of the transformed cells, putatively transformed plant cells or plants may 
be evaluated, for instance using Southern analysis, forthe presence of the gene of interest, copy number and/or genomic 
organization. Alternatively or additionally, expression levels or expression patterns of the newly introduced DNA may 
be undertaken using northern and/or Western analysis, both techniques being well known to persons having ordinary 
so skill in the art. 

[0061] The present invention clearly extends to plants obtainable by any of the methods according to the present 
invention, which plants comprise any of the isolated promoters or the constructs of the present invention. The present 
invention clearly extends to any plant parts and propagules of such plant. The present Invention extends further to 
encompass the progeny of a primary transformed cell, tissue, organ or whole plant that has been produced by any of 
ss the aforementioned methods, the only requirement being that progeny exhibit the same genotypic and/or phenotypic 
characteristic(s) as those produced In the parent by the methods according to the invention. The invention also extends 
to harvestable parts of a plant, such as but not limited to seeds, leaves, fruits, flowers, stem cultures, stem, rhizomes, 
roots, tubers, bulbs and cotton fibers. 
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[0062] The term "plant" or "plants" as used herein encompasses whole plants, ancestors and progeny of plants and 
plant parts, including seeds, shoots, stems, roots (including tubers), and plant cells, tissues and organs. The term "plant" 
therefore also encompasses suspension cultures, embryos, meristematic regions, callus tissue, gametophytes, sporo- 
phytes, pollen, and microspores. Plants that are particularly useful in the methods of the Invention include all plants 
which belong to the superfamily Viridipiantae, in particular monocotyledonous and dicotyledonous plants including a 
fodder or forage legume, ornamental plant, food crop, tree, or shrub selected from the list comprising Acacia spp., Acer 
spp., Actinidia spp., Aescuius spp., Agathis austraiis, Albizia amara, Atsophila tricolor, Andropogon spp., Arachis spp, 
Areca catechu, Astelia fragrans, Astragalus cicer, Baikiaeaplurijuga, Betula spp., Brassica spp., Bruguiera gymnorrhiza, 
Burkea africana, Butea frondosa, Cadaba farinosa, Cailiandra spp, Camellia sinensis, Canna indica, Capsicum spp., 
Cassia spp., Centroema pubescens, Chaenomeles spp., Cinnamomum cassia, Coffea arabica, Colophospermum 
mopane, Coronilfia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp„ Cyathea deaibata, 
Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea deaibata, Cydonia oblonga, Dalbergia monetaria, 
Davallia divaricata, Desmodium spp., Dicksonia squarosa, Dlheteropogon amplectens, Dioclea spp, Dolichos spp., 
Dorycnium rectum, Echinochfoa pyramidalis, Ehrartia spp. t Eleusine coracana, Eragrestis spp„ Erythrina spp., Euca- 
lyptus spp., Euclea schimperi, Eulalia viflosa, Fagopyrum spp., Feijoa sellowiana, Fragaria spp. t Fiemingia spp, Frey- 
cinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine javanica, Gliricidia spp, Gossypium hirsutum, Grevillea 
spp., Guibourtiacoleosperma, Hedysarum spp., Hemarthiaaltissima, Heteropogon contortus, Hordeumvulgare, Hypar- 
rhenia rufa, Hypericum erectum, Hyperthelia dissoluta, Indigo incarnata, Iris spp., Leptarrhena pyrolifolia, Lespediza 
spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonus bainesii, Lotus spp., Macrotyioma axillare, 
Malus spp., Manlhot esculenta, Medicago sativa, Metasequoia glyptostroboides, Musa sapientum, Nicotianum spp., 
Onobrychis spp., Omithopus spp., Oryza spp., Peltophorum atricanum, Pennisetum spp., Persea gratissima, Petunia 
spp., Phaseolus spp., Phoenix canadensis, Phormium cookianum, Photinia spp. t Picea glauca, Pinus spp., Pisum 
sativum, Podocarpus totara, Pogonarthria fleckii, Pogonarthria squarrosa, Populus spp., Prosopis cineraria, Pseudot- 
suga menziesii, Pterolobiumstellatum, Pyrus communis, Quercusspp., Rhaphiolepsis umbeliata, Rhopafostyiis sapida, 
Rhus natalensis, Ribes grossularia, Ribesspp., Robinia pseudoacacia, Rosa spp., Rubusspp., Salix spp., Schyzach- 
yrium sanguineum, Sciadopitys verticillata, Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, 
Spinaciaspp., Sporobolusftmbriatus, Stiburusalopecutvides, Stylosanthoshumilis, Tadehagispp, Taxodiumdistichum, 
Themedatriandra, Trifolium spp., Triticum spp., Tsuga heterophylla, Vaccinium spp., Viciaspp. Vitisvinifera, Watsonia 
pyramidata, Zantedeschia aethiopica, Zea mays, amaranth, artichoke, asparagus, broccoli, brussel sprout, cabbage, 
canola, carrot, cauliflower, celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, 
sugarbeet, sugarcane, sunflower, tomato, squash, and tea, trees and atgae amongst others. According to a preferred 
feature of the present invention, the plant is a crop plant such as soybean, sunflower, canola, alfalfa, rapeseed, cotton, 
tomato, potato, tobacco, squash, papaya, poplar, leguminosa, flax, lupinus or sorghum. According to another preferred 
embodiment of the present invention the plant is a monocotyledonous plant, such as sugarcane, further preferable a 
cereal such as rice, maize, wheat, barley, millet, rye or oats. 

[0063] The invention further provides a method for driving and/or regulating expression of a nucleic acid in a plant or 
plant cell, comprising: 

a) Operably finking a nucleic acid to an isolated nucleic acid according to the invention as described hereinabove, 
such as to any one of SEQ ID NO 1 to 22 or a variant or fragment thereof, and 

b) Introducing the resultant genetic construct into a plant or plant cell. 

[0064] Preferably the operably linked nucleic acid of (a) is heterologous to the nucleic acids according to the present 
invention. 

[0065] This method may further comprise cultivating the transformed plant or plant cell under conditions promoting 
growth, promoting regeneration and/or promoting maturation. 

[0066] Furthermore, the expression of the operably linked nucleic acid may be driven and/or regulated in particular 
cells, tissues or organs of a plant Accordingly, the invention provides a method as described above, wherein the 
expression is constitutive expression or tissue-specific expression. For these embodiments, reference is made to the 
example section where the specific expression patterns of the promoters according to the invention are described and 
where different types of tissue-specific expression are detailed. 

[0067] The present invention further encompasses the use of an isolated nucleic acid as def ined hereinabove to drive 
and/or regulate expression of an operably linked nucleic acid. 

[0068] The person skilled in the art will recognize that provision of sequences SEQ ID NO 1 to 22, readily makes 
available the tools to isolate related promoters, which may have substantial sequence identity to any of SEQ ID ID NO 
1 to 22. Additionally, provision of sequences SEQ ID NO 23 to 44 (CDS corresponding to the promoters of the present 
invention, see Table 1), readily makes available the tools to isolate related promoters, of which the related CDSs may 
have substantial sequence identity to any of SEQ ID NO 23 to 44. Therefore the present invention also encompasses 
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a method for Isolating nucleic acids, capable of driving and/or regulating expression of an operably linked nucleic acid, 
comprising screening a nucleic acid sequence database to find homologues of any of the sequences represented by 
SEQ ID N0 1 to 22 or SEQ ID NO 23 to 44. Subsequently these homologues are used to screen a library with genomic 
DNA, which library is for example preparedf rom the organism of origin of the above mentioned homologue. The screening 
s procedure may for example involve hybridization. Subsequently, the genomic DNA that matches the homologue, is 
analysed to identify the transcription initiation site and the translation initiation site of the gene corresponding to the 
homologue. Finally, specific primers are designed for amplification of a nucleic acid located in the region upstream (at 
the 5* end) of said translation initiation site. 

[0069] The present invention extends to the identification of regulatory proteins that are involved in the regulation of 
10 the activity of the promoters according to the present Invention. Such identification may be achieved using a yeast one- 
hybrid system. In such a yeast one-hybrid system the sequences according to any one of SEQ ID N0 1 to 22 are operably 
linked to the GAL transcription activator and transformed to ayeast cell culture. That yeast cell culture is again transformed 
with a library of constructs encoding candidate regulatory factors. 

[0070] The present invention will now be described with reference to the following figures in which: 

15 

Figure 1 shows a general schematic representation of a promoter. Regulatory elements are sequences that may 
for example be responsible for special and/or temporal regulation of the promoter activity. The minimal promoter Is 
the minimal sequence necessary and sufficient to drive expression. It includes a TATA box, which Is necessary to 
correctly direct the RNA polymerase II to the transcription initiation site. The transcription initiation element (INR) 
20 includes the transcription initiation start site. The 5' untranslated region (5'UTR) Is the region that Is transcribed into 

pre-messenger RNA and eventually into mRNA, but is not translated Into protein. The translation Initiation codon is 
represented by the startcodon ATG. 

Figure 2 is a map of the vector p46B1 useful for expression in plants of a p-gtucuronidase (GUS) gene under control 
25 of any one of the promoters according to the invention. This binary vector comprises a Gateway recombination 

cassette, suitable for the recombination cloning of any of the promoters of the present invention in front of the 
Escherichia coli p-glucuronidase (GUS) gene. This cassette contains a chloramphenicol resistance gene (CamR) 
and the ccdB suicide gene for counter selection of non-recombined plasmids, This GUS expression cassette further 
comprises the double terminator sequence T-zein and T-rbcS-deltaGA. This expression cassette Is located within 
30 the left border (LB repeat, LB Ti C5B) and the right border (RB repeat, RB Ti C58) of the nopaline Ti plasmid. Cloned 

within these borders are also selectable marker and a screenable marker genes each under control of a constitutive 
promoter and a terminator sequence. This vector also contains an origin of replication (pBR322) for bacterial repli- 
cation and a bacterial selectable marker (Spe/SmeR) for bacterial selection. 

The following figures show the results of the GUS staining of plants or plant parts transformed with the reporter 
35 vector p4581 carrying a promoter according to the present invention operably linked to the reporter gene GUS. 

Plants denoted "C plants - are transgenic plants grown to about 5 cm; Plants denoted "B plants" are grown to about 
10 cm; and plants denoted "A plants 0 are grown to maturity. These A plants were used to collect different tissue 
samples from old leaves, young leaves and seeds. 

40 Figure 3 shows the expression pattern of PRO01 1 0 (RCc3, SEQ ID NO 1 ). GUS staining is visible in roots. 

Figure 4 shows the expression pattern of PRO0005 (putative beta-amylase, SEQ ID NO 2). GUS staining is visible 
in seeds, more specifically in the embryo or in the scutellum of the embryo. 

45 Figure 5 shows the expression pattern of PRO0009 (putative cellulose synthetase, SEQ ID NO 3). GUS staining 

is visible in roots. 

Figure 6 shows the expression pattern of PRO005 (proteinase inhibitor Rgpi9, SEQ ID NO 4). GUS staining is 
visible In the seeds. 

50 

Figure 7 shows the expression pattern of PRO0061 (beta expansine EXPB9, SEQ ID NO 5). GUS staining is visible 
in young flowers of A plants (A) and in other young expanding tissues of B plants (B) and C plants (C). 

Figure 8 shows the expression pattern of PRO0063 (putative structural protein, SEQ ID NO 6). GUS staining is 
55 visible In young tissues, for example in the call! (A) or old leaves, young leaves and seeds of "A plants" (B). 

Figure 9 shows the expression pattern of PRO0081 (putative caffeoyl-CoA 3-O-methyltransf erase, SEQ ID NO 7). 
GUS staining is visible in young tissues, particularly of the shoot 
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Figure 10 shows the expression pattern of PRO0091 {prolamine RP5, SEQ ID NO 8). GUS staining is visible in 
seeds (A), particularly in the endosperm, and in meristem (B). 

Figure 11 shows the expression pattern of PRO0095 (putative amino peptidase, SEQ ID NO 9). GUS staining Is 
5 visible in seeds, more particularly in the embryo. 

Figure 12 shows the expression pattern of PRO0111 (uclacyanin 3-like protein, SEQ ID NO 10). GUS staining is 
visible in roots and in meristem. 

io Figure 1 3 shows the expression pattern of PRO01 1 6 (26S proteasome regulatory particle non-ATPase subunit 1 1 , 

SEQ ID NO 1 1). GUS staining Is weakly visible in the whole plant (weak constitutive) and is particularly visible in 
meristem. 

Figure 14 shows the expression pattern of PRO01 17 (putative 40S ribosomal protein, SEQ ID N0 12). GUS staining 
*5 is visible In the seeds, more particularly In the endosperm. 

Figure 15 shows the expression pattern of PRO0122 (chlorophyll a/b-bindlng protein presursor (Cab27), SEQ ID 
NO 13). GUS staining is visible in the shoot. 

20 Figure 16 shows the expression pattern of PRO012 (putative protochlorophyllide reductase, SEQ ID NO 14). GUS 

staining is visible in the shoot (above-ground tissues). 

Figure 17 shows the expression pattern of PRO0133 (chitinase Cht-3, SEQ ID NO 15). GUS staining is visible in 
the roots and meristem. 

25 

Figure 18 shows the expression pattern of PRO0151 (WSI18, SEQ ID NO 16). GUS staining is visible in the calli 
and upper plant parts (A) as well as in the aleurone layer and embryo (B). 

Figure 19 shows the expression pattern of PRO0169 (aquaporine, SEQ ID NO 17). GUS staining Is visible in the 
30 whole plant (constitutive expression). 

Figure 20 shows the expression pattern of PRO0170 (High mobility group protein, SEQ ID NO 18). GUS staining 
is strongly visible in the whole plant as is illustrated by the "B plants" (A), and various tissues such as old leaves, 
young leaves and seeds (B) and calli (C) (constitutive expression). 

35 

Figure 21 shows the expression pattern of PRO01 71 (reversibly glycosylated protein RGP1 , SEQ ID NO 1 9). GUS 
staining is visible in ail plant parts (constitutive expression). 

Figure 22 shows the expression pattern of PRO0173 (cytosolic MDH, SEQ ID NO 20). GUS staining is visible in 
40 all plant parts and particularly in the shoot (above-ground tissues) and seeds. 

Figure 23 shows the expression pattern of PRO0175 (RAB21, SEQ ID NO 21). GUS staining is weakly visible in 
calli (A), meristems and young leaves, and is strongly visible in developing and maturing seeds (B) more particularly 
in the embryo. 

45 

Figure 24 shows the expression pattern of PRO0177 (Cdc2-1, SEQ ID NO 22). GUS staining is weakly visible in 
meristem and in leaf sheets. 

Examples 

50 

[0071] The promoters according to the present invention were isolated as DNA regions spanning about 1 .2 kb of the 
sequence upstream of the translation initiation codon (i.e. first ATG, which codon was excluded) from various rice genes. 
For determination of their nucleic acid sequence and their expression pattern, the following procedure was followed: 
First in silico studies on genomic rice sequences were performed. However, procedures based on automated prediction 
55 programs to locate promoter-like nucleic acid sequence are highly error prone, even for the localization the best-char- 
acterized promoter control elements such as the TATA box and the transcription Initiation element (INR). Also, in silico 
determination of expression pattern is extremely speculative. Therefore, to obtain unambiguous data about the nucleic 
acid sequence and the expression pattern of the promoters, in vivo studies were performed encompassing (i) Isolation 
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of the promoter nucleic acid sequence; (ii) operabiy linking a reporter gene to the promoter and introducing the resulting 
genetic construct into a host organisms; (iii) growing the transformed host cell under conditions allowing expression of 
the reporter gene, and (iv) determination of the reporter gene activity in the different tissues of the host organism. These 
methods are now described in more detail. 

5 

Example 1. Identification and Isolation of the promoters 

Identification of rice ESTs, the corresponding genes and their location in the rice genome 

io [0072] Sequence databases, comprising rice sequences, were searched for rice expressed sequence tags (ESTs), 
Subsequently an 7ns///co tf Northern-blot was performed to allow identification of EST families that are strongly expressed 
or that are specific for a particular organ. This analysis included normalization of the numbers of ESTs isolated from 
different plant organs. The ESTs families with an interesting distribution among source cDNA libraries were selected for 
further analysis and sequence homology searches. After sequence homology searches in combination with scanning 

is scientific data, the genes that correspond to those families of ESTs were identified from sequence databases and a 
(putative) function and corresponding gene name was given (see Table 1). Subsequently, the corresponding promoter 
region was isolated by the following procedure. In a first step the TIGR database was searched to find a tentative contig 
corresponding to an EST family. Sequence homology was found using standard computer programs, such as Blast N 
using standard parameters (typically G Cost to open a gap = 5, E Cost to extend a gap = 2, q Penalty for a mismatch 

20 in the blast portion of run = -3, r Reward for a match in the blast portion of run = 1, e Expectation value = 10.0, W Word 
size = 1 1 , v Number of one-line descriptions = 100, b Number of alignments to show = 100, Matrix = BLOSUM62). The 
TIGR database (The Institute for Genomic Research), provides Tentative Contigs (TC) which are sequence predictions 
based on contig building from all known EST, from all known cDNA and from reconstructed mRNA. The TCs used for 
identification of the promoters of the present invention are represented in Table 1. In a second step these TCs were 

25 used to locate the corresponding gene on a genomic sequence, which gene comprises the coding region as well as the 
promoter region. Generally, these genomic sequences were BAG clones, which are represented herein by their Genbank 
accession number (see Table 1). From these BAC clones the sequence identity of the promoter region could be deter- 
mined. ' 
[0073] Table 1 : fist of rice promoters of the present invention. The promoter sequences are represented herein by 

30 their SEQ ID NO and promoter number (PRO). The coding sequences (CDS) naturally driven by a promoter of the 
present invention are represented by their name, by SEQ ID NO and by Tentative contig (TC) accession number of 
the TIGR database. The Genomic sequences (BAC clones or genes) comprising a promoter region of the present 
invention are represented by their Genbank accession number. 
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Prom SEQ ID NO 


Prom number 


CDS name 


CDS SEQ ID NO 


CDSTC 


BAC clone (*or 
gene) 


1 


PRO0110 


RCc3 


23 


TC89946 


AC037426 


2 


PRO0005 


putative beta- 
amylase 


24 


TC90358 


AC022457 


3 


PRO0009 


putative cellulose 
synthase 


25 


TC83635 


AC022457 


4 


PRO0058 


proteinase inhibitor 
RgpI9 


26 


TC83117 


AF044059 


5 


PRO0061 


beta expansine 
EXPB9 


27 


TC89913 


AC020666 


6 


PRO0063 


structural protein 


2B 


TC89985 


AP001278 


7 


PRO0081 


putative caffeoyl- 
CoA 3-0- 
methyltransf erase 


29 


TC89891 


AP000364 


8 


PRO0091 


prolamine RP5 


30 


TC89670 


AF156714* 


9 


PRO0095 


putative methionine 
aminopeptidase 


31 


TC89883 


AC027133 
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(continued) 





Prom SFQ ID NO 


Pmm niimhpr 


PHQ nntnn 

wl/O IICIIIIC? 


ens qpo in Nn 




omu clone \ or 

gene) 


5 


1fl 
I U 


PRom 1 1 

r nUU I I 1 


ULrlctuydnLI I 0-IIP\c 

protein 






A iQnTRRO 

AJoU/OD£ 


TO 


11 


PRO0116 


26S proteasome 
regulatory pantcie 
non-ATPase 
subunltn 


33 


TC83072 


AP000969 




to 


rnUUi I f 


n i i+fa + it rr\ J\ C\ C 

puiaTIVG *HJo 

ribosomal protein 




i uyuuoo 


AUU90B/ 1 


15 


13 ! 


PRO0122 


chlorophyll a/b- 
binding protein 
presursor (Cab27) 


35 


TC82936 


AP004700 


20 


14 


PRO0123 


putative 

protochlorophydide 
reductase 


36 


TC89839 


AL606456 




15 


PRO0133 


chttinase Cht-3 


37 


TC85888 


D16223* 




16 


PRO0151 


WSI18 


38 


TC84300 


AP003023 


25 


17 


PRO0169 


aquaporine 


39 


TC89687 


AP00510B 


18 


PRO0170 


High mobility group 
protein 


40 


TC89846 


AP004004 


30 


19 


PRO0171 


reversibly 
glycosylated 
protein RGP1 


41 


TC82935 


AC090874 




20 


PRO0173 


cytosolic MDH 


42 


TC82977 


AC037425 




21 


PRO0175 


RAB21 


43 


TC83646 


Y00B42* 


35 


22 


PRO0177 


Cdc2-1 


44 


TC90619 


AP004765 



Identification and Isolation of the promoter regions of rice genes 



[0074] Starting from the sequence information of the genes and their location in the rice genome, the promoter regions 
40 of these genes were isolated as the DNA region spanning about 1 .2 kb upstream of the translation initiation codon (i.e. 
first ATG), which codon was excluded. When an intervening sequence such as an intron, waspresent in the 5' untranslated 
region of the gene, the isolated DNA region was taken as the region spanning about 1.2 kb plus the length of that 
intervening sequence. The promoter regions were isolated from genomic DNA of Oryza sativa Japonica or exceptionally 
from Oryza sativa Indies via PCR using specific primers. These specific primers comprise AttB recombination sites, 
45 suitable for recombination cloning of the isolated promoter region These specific primers are herein represented as SEQ 
ID NO 45 to 8B and are listed in Table 2. Conditions for PCR were as follows: 1 cycle of 2 min at 94°C, 35 cycles of 1 
min at 94°C, 1 min at 5B°C and 2 min at 68°C, and 1 cycle of 5 min at 68°C. The length of the expected PCR fragment 
is also indicated in Table 2. The corresponding PCR fragment was purified from the PCR reaction mix via gele electro- 
phoresis and subsequent purification using Zymociean Gel DNA Recovery Kit (Zymo Research, Orange, California), 

so 

Table 2: Overview of the primers used to isolate the rice promoters of the present invention and the length of the 



rice promoter regions. 



Promoter 


Promoter 


Prom length 


Primer 


Primer 


Primer 


Primer 


SEQ ID NO 


number 




forward 


forward 


reverse SEQ 


reverse 








SEQ ID NO 




ID NO 




1 


PRO0110 


1264 


45 


prm3780 


67 


prm3781 



13 



EP 1 801 223 A2 



(continued) 



Promoter 
SEQ ID NO 


Promoter 
number 


Prom length 


Primer 
forward 
SEQ ID NO 


Primer 
forward 


Primer 
reverse SEQ 
ID NO 


Primer 
reverse 


2 


PRO0005 


1215 


46 


prm276B 


68 


prm2769 


3 


PRO0009 


1038 


47 


prm2420 


69 


prm2421 


4 


PRO0058 


1301 


48 


prm2853 


70 


prm2854 


5 


PRO0061 


1243 


49 


prm2426 


71 


prm2427 


6 


PRO0063 


1019 


50 


prm2855 


72 


prm2856 


7 


PRO0081 


1212 


51 


prm3025 


73 


AA/\n 

prm3026 


8 | 


PRO0091 


1052 


52 


prm3029 


74 


prm3030 


9 


PRO0095 


1216 


53 


prm306l 


75 


prm3062 


10 


PRO0111 


1237 


54 


prm3031 


76 


prm3032 


11 


PRO0116 


1100 


55 


prm3051 


77 


prm3052 


12 


PRO0117 


1216 


56 


prm3592 


78 


prm3049 


13 


PRO0122 


1210 


57 


prm5131 


79 


prm2195 


14 


PRO0123 


123 


58 


prm3782 


80 


prm2197 


15 


PRO0133 


1808 


59 


prm2844 


81 


prm2845 


16 


PRO0151 


1828 


60 


prm2973 


82 


prm2974 


17 


PnOOloy 


126/ 


C1 
Dl 


m O ~f 7 Pi 

prmo/ /u 


DO 


pnTIJ//i 


18 


PRO0170 


1130 


62 


prm3772 


84 


prm3773 


19 


PRO0171 


1230 


63 


prm3774 


85 


prm3775 


20 


PRO0173 


1234 


64 


prm3776 


86 


prm3777 


21 


PRO0175 


1553 


65 


prm3800 


87 


prm3B01 


22 


PRO0177 


1087 


66 


prm5135 


86 


prm5136 



Example 2. Cloning of promoter-GUS reporter vectors for plant transformation 



[0075] The purified PCR fragments of Example 1, corresponding to the promoter regions of the present Invention, 
were cloned into the pDONR201 entry plasmid of the GatewayTM system (Life Technologies) using the "BP recombination 
reaction". The identity and base pair composition of the cloned Insert was confirmed by sequencing and additionally, 
the resulting plasmid was tested via restriction digests. 

[0076] In order to clone each of the promoters of the present invention in front of a reporter gene, each entry clone of 
Example 1 was subsequently used in an *LR recombination reaction a (Gateway TM) with the destination vector p4581. 
This destination vector was designed to operably link each promoter of the present invention to the Escherichia coli 
beta-glucuronidase (GUS) gene via the substitution of the Gateway recombination cassette in front of the GUS gene. 
Furthermore this destination vector is suitable for transformation of plants and comprises within the T-DNA left and right 
borders the resulting promoter-GUS cassette and selectable marker and screenable marker cassettes (see Figure 2). 
The resulting reporter vectors, comprising a promoter of the present invention operably linked to GUS, are subsequently 
transformed into Agrobacterium strain LBA4044 and subsequently into rice plants using standard transformation tech- 
niques. 

Example 3. Expression patterns of the promoter-GUS reporter cassette in plants Growth and harvest of trans- 
genic plants or plant parts at various stages (C plants, B plants and A plants) 

[0077] For each promoter-GUS reporter construct, 3 TO transgenic rice plants were generated from transformed ceils. 
Plant growth was performed under normal conditions. The first transgenic plant was sacrificed for GUS staining when 
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it had reached a size of about 5 cm, which plant is named herein "G plant". The second transgenic plant was sacrificed 
for GUS staining when it had reached a size of about 10 cm, which plant is named herein M B plant". The third transgenic 
plant was kept for seed production and is named herein "A plant". GUS staining was performed on complete C and B 
plants. On A plants, GUS staining was performed on leaf pieces, flowers and section of seeds at various developmental 
s stages. A plants were allowed to set seed, which seeds were used after harvest for confirmation of the expression pattern 
inT1 plants. 

GUS staining 

w [0078] The sacrificed plants or plant parts were covered with 90% ice-cold acetone and incubated for 30 min at 4 °C. 
After 3 washes of 5 min with Tris buffer [15,76 g Trizma HCI (Sigma T3253) + 2,922 g NaCI in 1 I bidi, adjusted to pH 
7,0 with NaOH], the material was covered by a Tris/ferricyanate/X-Gluc solution [9,B ml Tris buffer + 0,2 ml ferricyanate 
stock (0,33 g Potassium ferricyanate (Sigma P3667) in 1 0 ml Tris buffer)* 0,2 ml X-Gluc stock (26,1 mg X-Gluc (Europa 
Byproducts ML 1 13A) in 500 DMSO)]. Vacuum infiltration was applied for 15 to 30 minutes. The plants or plant parts 

is were incubated for up to 1 6 hours at 37 °C until development of blue colour was visible. The samples were washed 3 
times for 5 minutes with Tris buffer. Chlorophyll was extracted in ethanol series of 50%, 70% and 90% (each for 30 
minutes). 

Expression patterns of the promoters of the present invention 

20 

[0079] The expression patterns of the rice promoters of the present invention are summarized In Table 3. 



Table 3: expression patterns of the rice promoters of the present invention 



25 


PRO SEQ ID NO 


Promoter number 


Promoter name 


Expression pattern 




1 | 


PRO0110 


RCc3 


strong root 




2 


PRO0005 


putative beta-amylase 


Embryo (scutellum) 




3 


PRO0009 


putative cellulose synthase 


weak in roots 


30 


4 


PRO0058 


proteinase inhibitor Rgpi9 


seed 




5 


PRO0061 


beta expansine EXPB9 


weak in young tissues 




6 


PRO0063 


structural protein 


young tissues+calli+embryo 


35 


7 


PRO0081 


putative caffeoyl-CoA 3-0- 
methyltransferase 


shoot 




8 


PRO0091 


prolamine RP5 


meristem + strong in endosperm 




9 


PRO0095 


putative methionine aminopeptidase 


embryo 


40 


10 


PRO0111 


uclacyanin 3-like protein 


weak meristem 




11 


PRO0116 


26S proteasome reg, particle non- 
ATPase s.u, 11 


weak meristem 




12 


PRO0117 


putative 40S ribosomal protein 


weak in endosperm 


45 


13 


PRO0122 


chlorophyll a/b-binding protein 
presursor (Cab27) 


weak in shoot 




14 


PRO0123 


putative protochtorophyllide 
reductase 


strong shoot specific 


50 


15 


PRO0133 


chitinase Cht-3 


weak meristem specific 




16 


PRO0151 


WS118 


Calli + shoot + strong embryo 




17 


PRO0169 


aquaporine 


medium constitutive 


55 


18 


PRO0170 


High mobility group protein 


strong constitutive 


19 


PRO0171 


reversibly glycosylated protein RGP1 


weak constitutive 




20 


PRO0173 


cytosolic MDH 


Shoot and seed 



15 



EP 1 801 223 A2 



(continued) 



PRO SEQ ID NO 


Promoter number 


Promoter name 


Expression pattern 


21 


PRO0175 


RAB21 


embryo 


22 


PRO0177 


Cdc2-1 


weak in meristem + strong seed 



[0080] The following paragraphs describe the observed expression patterns of the promoters of the present invention 
in more detail The observations are based on the visual inspection of the GUS stained tissues as described above. It 
to is to be understood that for some promoters expression may be weak and that expression in certain tissues may only 
be visible with very sensitive detection methods. 

PRO01 10 - SEQ ID NO 1-RCc3 

is [0081] 1 construct (OS1432), which is a reporter vector as described in Example 2 comprising PRO01 10 was inves- 
tigated. 25 call], 14 C, 21 B plants and 21 A plants were analysed. There was no expression visible in calli, but strong 
expression in roots of C plants (93%) and of B plants (81 %) was observed. No expression in the shoots of A plants was 
observed. Therefore the RCc3 promoter PRO01 1 0 is suitable for strong expression in roots. 

20 PRO0005 - SEQ ID NO 2- putative beta-amylase 

[0082] 1 construct (OS1365) was investigated. 28 calli, 24 B plants and 22 A plants were analysed. Occasional 
expression in calli (7%) was observed as well as occasional weak expression in roots (4%) and shoots (12%) of B plants, 
expression in the scutellum of embryos of A plants (43%) and occasional expression in leaves (5%) of A plants. This 
25 promoter is therefore suitable for expression in embryo, more preferably in the scutellum of the embryo. This region of 
the embryo is also referred to as the transfer layer of the embryo. This promoter may have some leakiness in other tissues. 

PRO0009 - SEQ ID NO 3- putative cellulose synthase 

30 [0083] 1 construct (OS1461 ) was investigated. 20 calli, 20 C, 20 B plants and 20 A plants were analysed. Occasional 
expression in calli (20 %) was observed as well as weak expression in roots (55%) of C plants, occasional expression 
in young leaves (1 0%) of C plants and weak expression in the roots (25%) of B plants. No expression in leaves of A or 
B plants was observed. Therefore this promoter is suitable for expression in roots. This promoter may show some 
leakiness in the leaves. 

35 

PRO0058 - SEQ ID NO A- proteinase Inhibitor Rgpl9 

[0084] 1 construct (OS1 370) was investigated. 1 3 B plants and 1 2 A plants were analysed. No expression was observed 
in B plants. In A plants, no expression was observed in the leaves, but there was strong expression in endosperm and 
40 embryo (58-42%). Therefore, this promoter PRO005B is suitable for expression in seeds. 

PRO0061 - SEQ ID NO 5- beta expansine EXPB9 

[0085] 2 constructs (OS1 441 and OS1 460) were investigated. 20 calli, 32 C, 32 B plants and 32 A plants were analysed. 
45 Weak expression was observed in the leaves of C and B plants. In A plants expression in the flowers was observed 
(44%), more particularly in lemma of young spikelets. It was concluded that the promoter PRO0061 is suitable for 
expression in young tissue, more preferably in young, developing or expanding tissue, more preferably in green tissue. 

PRO0063 - SEQ ID NO 6- putative structural protein 

50 

[0086] 1 construct (OS1446) was investigated. 13 calli, 13 C, 13 B plants and 12 A plants were analysed. In calli, 
weak expression was detected (92%). In C plants, there was no expression in roots and there was weak expression in 
some leaves (46%). In B plants, there was no expression in roots and weak expression in young tillers (78%) or young 
leaves (54%), but no expression in old leaves. In A plants, there was occasional expression in young leaves (1 7%) and 
ss expression in embryo and scutellum (42%). Therefore it was concluded that this promoter is active in the above-ground 
tissues, such as leaf, stem and seed. These data demonstrate that the promoter is suitable for expression in calli and 
in the shoot, and for expression in young tissues and seeds. 
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PRO0081 - SEQ ID NO 7- putative caffeoyl-CoA 3-O-methyltransferase 

[0067] 1 construct (0S1 41 9) was investigated. 20 calli, 20 C, 20 B plants and 20 A plants were analysed. No expression 
was observed in Calil Expression was observed in C plants, more particularly weak expression in root cylinder (40%) 
5 and weak expression in young leaves (80%) and in old leaves. Expression was also observed in B plants, more particularly 
weak expression in roots (25%) and weak expression in young leaves (60%). Expression was also observed in young 
leaves (50%) of A plants. It was concluded that promoter PRO0081 is suitable for expression in above-ground tissues, 
preferably in the shoot This promoter may have some leakage of expression in roots. 

10 PRO0091 - SEQ ID NO d- prolamine RP5 

[0088] 1 construct (OS 1558) was investigated. 12 C, 12 B plants and 1 2 A plants were analysed. Weak expression 
was observed in the discrimination centre (50%) of C plants and in the discrimination centre (58%) of B plants. Strong 
expression was observed in endosperm (55%) of A plants. This promoter was found to be useful for strong expression 
is in the endosperm, with leakiness in meristem, preferably the shoot meristem or discrimination centre. 

PRO0095 - SEQ ID NO 9- putative methionine amlnopeptldase 

[0089] 1 construct (OS1423) was investigated. 16 calli, 14 C, 14 B plants and 16 A plants were analysed. Some 
20 expression was observed in root-tips (36 %) of C plants and in the embryo (38%) of A plants, but not in endosperm of 
A plants. It was concluded that PRO0095 is suitable for expression in embryo. 

PRO01 1 1 - SEQ ID NO 10-uctacyanln 3-llke protein 

25 [0090] 1 construct (OS1421) was investigated. 22 calti, 21 C, 22 B plants and 21 A plants were analysed. Weak 
expression was observed in the discrimination centre and meristems (77%) of B plants. It was concluded that promoter 
PRO01 1 1 is suitable for weak expression in the meristem, preferably in shoot meristem or discrimination centre. 

PRO01 16 - SEQ ID NO 1 1-26S proteasome regulatory particle non~A TPase subunit 1 1 

30 

[0091] 1 construct (OS1679) was investigated. 13 C, 14 B plants and A plants were analysed. Weak expression was 
observed in meristem/discrtmination centre of C plants (38%) and of B plants (71%) and in young leaf sheaths of C 
plants (77%) and of B plants (21%). It was concluded that promoter PRO0116 is suitable for expression in meristem, 
preferably in shoot meristem or discrimination centre. 

35 

PRO0117- SEQ ID NO 12- putative 40S rlbosomal protein 

[0092] 1 construct (OS1 425) was investigated. 9 calli, 9 C, 9 B plants and 9 A plants were analysed. Occasional weak 
expression was observed in roots (22%) and in young leaf blades (44%) of C plants. Expression was mainly observed 
to in endosperm (37%) of A plants. Therefore, promoter PR01 1 7 was found to be suitable for expression in endosperm 
and may have some leakiness in young leaves. 

PRO0122 - SEQ ID NO 13- chlorophyll alb-binding protein presursor (Cab27) 

45 [0093] 1 construct (OSI 675) was investigated. 38 calli, 38 C, 38 B plants and 1 5 A plants were analysed. Very weak 
expression was observed in the discrimination centre and young leaf sheaths of C plants. It was concluded that this 
promoter PRO0122 is suitable for weak expression in shoots. 

PRO0123 - SEQ ID NO 14- putative protochlorophyllide reductase 

50 

[0094] 1 construct (0S1433) was investigated, 21 calli, 18 C, 19 B plants and 18 A plants were analysed. Strong 
expression was observed in shoots (33-66%) of C plants and B plants (63-79%). In B plants there was also occasional 
expression in roots. In A plants, again strong expression in young leaves (73%) was observed, as well as occasional 
expression in old leaves (39%). It was concluded that this promoter is suitable for strong expression in shoots, preferably 
55 in leaves. 
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PR00133 - SEQ ID NO 15- chitinase Cht-3 

[0095] 1 construct (OS16B7) was investigated. 15 calti, 12 C, 16 B plants and 12 A plants were analysed. Weak 
expression was observed in calli (66%) and in the discrimination centre/me ristem (50%) of B plants, tt was concluded 
5 that promoter PRO01 33 is suitable for weak expression in meristem, preferably in shoot meristem or discrimination centre. 

PRO0151 - SEQ ID NO 16- WSI18 

[0096] 1 construct (OS1458) was investigated. 22 cadi, 16 C, 16 B plants and 13 A plants were analysed. Strong 
10 expression was observed in calli (91 %) and weak expression in shoots of G plants (62%). In A plants there was very 
strong expression in the aleurone layer and in the embryo (46%). It was concluded that promoter PRO0151 is suitable 
for strong expression in calli and in seeds, more particularly in the aleurone layer and in the embryo of the seeds. 

PRO0169 - SEQ ID NO 1 7- aquaporine 

15 

[0097] 1 construct (OS1911) was investigated. 11 calii, 10 C plants, B plants and A plants were analysed. Some 
expression (55%) was observed in calli and in roots (30%) of C plants. Furthermore, good expression was observed in 
shoot tissues (80%) of C plants and in young leaves of B plants. It was concluded that this promoter is suitable for 
constitutive expression, preferably constitutive in young plants. 

20 

PR0170 - SEQ ID NO 18- High mobility group protein 

[0098] 1 construct (OS1 434) was investigated. 23 calli, 21 C, 21 B plants and 1 4 A plants were analysed. Expression 
was observed In calli (52%) and in roots (51 %) of C plants. Moreover, strong expression was observed in young leaves 
25 (81%) of C plants, in roots (86%) of B plants and in young leaves (86%) of B plants. In A plants there was strong 
expression in young leaves (75%), old leaves (43%), embryo and aleurone but a weaker expression in endosperm 
(82%). It was concluded that promoter PR01 70 is suitable for strong constitutive expression. 

PRO0171 - SEQ ID NO 19- reverslbiy glycosylated protein RGP1 

30 

[0099] 1 construct (OS1762) was investigated. 18 calli, 1 1 C and 13 B plants were analysed. Strong expression was 
observed in calli (44%) and in all tissues (27%) of C plants. In all tissues of B plants (16%), expression was somewhat 
weaker but most pronounced the in discrimination centres (46%). It was concluded that promoter PRO01 71 is suitable 
for constitutive expression. 

35 

PRO0173 - SEQ ID NO 20- cytosolic MDH 

[0100] 1 construct (OS1435) was investigated. 17 calli, 17 C, 1 7 B plants and 15 A plants were analysed. Occasional 
expression (12%) was observed in calli and weak expression was observed in upper parts (24-69%) of C plants as well 
*o as in young leaves (41 %) of B plants. In A plants, expression in leaves (33%) was observed and strong expression in 
seeds (38%), but not in the root. It was concluded that the promoter PRO0173 is suitable for expression in above-ground 
tissues especially for constitutive expression in the shoot and especially in the seeds. 

PRO017S - SEQ ID NO 21- RAB21 

45 

[0101] 1 construct (OS1 436) was investigated. 1 6 calli, 12 C, 15 B plants and 1 5 A plants were analysed. Expression 
was observed in some calli (31 %), in the discrimination centres (42%) of C plants and in young leaves (25-58%) of C 
plants and A plants(15 %). Furthermore, very strong expression was observed in aleurone and embryo (60%) of a plant 
It was concluded that promoter PRO0175 is suitable for strong expression in calli and in seeds, more particularly in 
so developing/maturing seeds, more particularly in the aleurone layer and in the embryo of the seeds. 

PRO0177 - SEQ ID NO 22- Cdc2-1 

[01 02] 1 construct (OS1 436) was investigated. 1 6 calli, 1 2 C, 1 5 B plants and 1 5 A plants were analysed. Expression 
55 was observed in some of the calli (31%), in the discrimination centre (42%) of C plants, in young leaves (25-58%) of C 
plants and occasionally in young leaves (1 5 %) of A plants. Moreover, very strong expression was observed in aleurone 
and embryo (60%) of seeds from A plants. It was concluded that this promoter is suitable for specific expression in 
seeds, more particularly in developing/maturing seeds. 
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Example 4. Stability of the expression patterns of the promoters of the present Invention in further generations 

[0103] The above-mentioned analyses were performed on TO plants originating from the transformed tissues. The 
stability of promoter activity in the next generations or progeny plants of the original TO plant, the so-called T1 and T2 
5 plants, was evaluated as follows. The TO plant transformed with the reporter constructs as mentioned in the above 
paragraphs of Example 2, were grown until maturity (A plants), of which the seeds (T1 seeds) were harvested and sown 
to generate progeny T1 plants. These plants were analysed as described above in Example 3 and the A T1 plants were 
allowed to reach maturity and to set T2 seeds. 

[01 04] The expression pattern of the promoters of the present invention was studied in TO plants, T1 seeds, T1 plants 
10 and T2 seeds and in all the tissues (including seeds and seed tissues) as described in Example 3. The specific expression 
patterns as reported from the TO andT1 seeds and described in Example 3 were confirmed in the followingTI generation 
and T2 seeds. It is concluded that the expression pattern of the promoters of the present are stably inherited in plants 
of subsequent generations. 

f5 Example 5. Stability of expression patterns of the promoters of the present Invention in other plants 

[0105] The above-mentioned plant analyses were performed on rice plants. This choice was based on the practical 
consideration that plant genetic engineering is most profitable for crop plants. Also in other crop plants, such as for 
example Zea Mays, the reporter constructs comprising the promoters according to the present invention are introduced 

20 and transformed plant are evaluated as described hereinabove. The expression patterns of the promoters according to 
the present invention are conserved among plants. Therefore, the promoters according to the present invention are also 
suitable for driving and/or regulating expression of an operably linked nucleic acid in monocots, such as corn. 
[0106] For many other purposes such as research and horticulture, (small) herbs are being genetically modified, which 
involves the use of promoters. Therefore the reporter constructs comprising the promoters according to the present 

25 invention are introduced into other plants species such as for example Arabidopsis thaliana and transformed plants are 
evaluated as described hereinabove. The expression patterns of the promoters according to the present invention are 
conserved among plants. Therefore, the promoters according to the present invention are also suitable for driving and/or 
regulating expression of an operably finked nucleic acid in other plant species such as for example dicots, such as 
Arabidopsis. 

30 



35 



40 



45 



50 



55 
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SEQUENCE LISTING 



<110> CropDesign N.V. 

<Z20> Rice promoters 

<130> PF58322 

<150> EP 03075331.3 

<151> 2003-02-04 



<160> 88 

<170> Patentln version 3.1 



15 



20 



25 



30 



40 



45 



50 



55 



<210> 
<211> 
<212> 



DNA 



<213> Oryza sativa 
<220> 

<221> misc_feature 

<223> PRO0110 - RCc3 



<400> 1 
tcgacgctac 
ttgcagactc 
tttcagcacc 
acatgggtct 
gccaaaggcc 
ctctattcaa 
^gggaagaag 
gaaggaggag 
aagaatccca 
Latctaagcg 
caggaagcea 
ggtaagctta 
gatattcctc 
tcttaaacgg 
cgacgcccga 
gcccgcgatg 
aaccaaatat 
aattaatcta 
atgcacgctc 
tcccggggct 
taatcatctc 
gate 

<210> 2 

<211> 1215 

<212> DNA 

<213> Oryza sativa 

<220> 

<22l> misc_feature 

<223> PRO00D5 - putative beta-amylase 



tcaagtggtg 
taargctatt 
aagagcttcc 
tggcgggcgc 
ttgagacgg- 
tgaaattgg- 
aggattttgr. 
gaggattttc 
ggcgcccatg 
ggccgaccta 
cgctgcaatg 
gaatcttctt 
cggzctcggt 
caaaagaaaa 
taggccaaga 
tgtcgtgtgc 
tcggccgatg 
cgagccttgc 
acatgatggc 
cga ctataaa 
cagctgatca 



ggaggccacc 
agtcgcctag 
ggtggctagt 
gaaacacctr 
aaagttttct 
ggctcactgg 
trrgtgagaga 
aggcttcgca 
ggctggcagt 
gtagccacgt 
guatcttccc 
gcccgttgga 
tcatgtgatg 
tcaccaactt 
tcgcgagata 
agcaatcttg 
atttaacaca 
tagggcaggt 
agggcagggt 
tacctcccta 
agagctctta 



gcatgttcca 
gatatttgga 
ctgatagcca 
eataggtggc 
atttgcgctt 
ttcattaaaa 
gagagagaga 
ttgcccaacc 
tt.accacgga 
gcctagtgta 
ctgtccttcg 
ctgggacacc 
tcctctcttg 
gctcacgcag 
aaataacaac 
gtcatttgcg 
ttatcagcgt 
gttctgccag 
tcacatgagc 
atcccatgat 
attagctagc 



acgaagcgcc 
atgaaaggaa 
aaattaagga 
ttacctttta 
gcgcatgtac 
aaaaaagaat 
gagagagaga 
tctgcttct.g 
cctacctagc: 
gattaaagtt 
cgtacgtgaa 
caccaatccc 
tgtgatcacg 
tcacgctgca 
caatgatcat 
ggatcgagtg 
agatgtacgt 
ccaatccaga 
tctaacggtc 
caaaaccatc 
tagtgattag 



aaagaaagcc 
ccgcagagtt 
ggatgccaaa 
acatgttcgg 
aattttattc 
ctagcctgrt 
ga gaga gaga 
ttggcccaag 
ctacctt.agc 
gccgggccag 
aacaaaccca 
accatgcccc 
gagcaagcat 
ccgcgcgaag 
aaggaaacaa 
cttcacagct 
acgatttgnt 
tcgccctcgt 
gattaattaa 
tcaagcagcc 
ctgcgct-tgt 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1264 



<4D0> 2 

cccgatttag tagaccacat 



;ttggcatca aaccaaaata gacc;ctcxcc cagaat-tgt 



60 
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aaatggcttt gnggttcgtg atatcactga acctgctggg tgaataaagt aaaaaaaaaa 120 

acccataaat tggccttctg caagatctcg tcgtcttgcc caaactatag ccttcgatct 180 

ttccatcagg accgcatggg gggagagcag gggcaagtat gaaatggagt tcagattcag 240 

attctagaac agtctgaaca tgcgacgacg acgatggcga tgtatctgaa caatctggtc 300 

5 czctccctct cctcccgggc gggcttccac gcggctgagt ttcaggctcc caatctgcag 360 

crcctcccag aacctractc tgattgattg gttcatcgtt tccatggctc caatgaatgc 420 

aacgtgttgt tcagattttc tgaatcttgt tctcaatccg gagtasgtgc tgtagcagca 48 0 

gcaatctgtc cctgatctga gaattttaga cactcgtaga ttcgctgatc aatcattccg 540 

tcccttcgag tggtctagat tgagcttaat catcctgcta ctcgaatcaa atcttcagca 600 

10 agtgagagct agataattca gaagaaatca acatattctt cgcgaaaaaa agaaataacc 660 

gatgaaacca cggtaattag gttcttcgaa tcaccgggag agtaggaaaa aacgagctaa 720 

aatcccacat aggaggaaac ggttaaaaac ggccactccg cgtct~cgcc gcgagactag 780 

ctctcgccag tccacgtagc ccaatccaca accgccacgt gctccgacaa tcccgcccgt 840 

ccatcgccgc ggccccggcc tcatctcgac cactcgtttc ctcccttcac accagccacg 900 

15 tggcactctc tcgagagctc ccgcccgcct atataaactt gttcgcgctc ggctcctcct 960 

cctcatcgac ctccacccca cattgaataa ttatttttaa taattttagt tttttttttg 1020 

gcttragata tattcccaat ccccaacctc ccaataatcc gatctctccc agttctgttc 1080 

ggatcaaggc tgtgtcgatc gcaaaaaaga aaaaaaaaac aattt^cttt tggggtggtt 1140 

catctgttga tcacttcttt: gtttcccgcg ttttgttggg gattcgattt t-cgggttaag 1200 

attttctaca cgacc 1215 

20 

<210> 3 

<211> 1038 

<212> DNA 

<213> Oryza sativa 

25 

<220> 

<221> rr.isc__feature 

<223> PRO0009 - putative cellulose synthase 
30 <400> 3 

gccatcgagt ggtgtgccga taccggcgcc tgttctttac agcctcagct agtgttgttg 60 

tccgaggcaa t^tttccgac ctattgtgtt gctttcctct ctgatagctt atggtaaaag 120 

aracaaagat g-tgaggagt ttgtacgcca cttaattttg ctcgtaacat acattgacaa 180 

tcaagaggag ccatggcatt gcgatctgct tacacggcat attcttactg gatggtgtac 240 

actacttacc czttttaatg caagcatcaa tccattgctt ttctcactgc acacctgatt 300 

cgtactgaaa acgtgaaaca taaaaaaaaa acaaaaatct agctgatgtt ggctctcggg 360 

gcctcgagtc tagtttgtcc tagatggcta acctgatatg tgttggtcac gctcacgttt 420 

gaaccgagaa agagtgtgtg tgtgtgtgtg tcggcgtgct gctacaccag agcctccctg 480 

aatcgcaatg cgtgttaacg ccagcatcgc aggatttcat ctcacttgac aggttcagat 540 

ggccttcctc craccgtctg ccatttatac acgcagtgac ttaacgctta cacgagccgg 600 

40 atggcccgga tctcccccct gcaccatctc accagaaaaa cggtgaggcg tcaccgcaac 660 

ccacccacca aacacatcca cgtcccttca ccgttggcct tcgattttgc ttcagctgca 720 

ctacgacccc tccaacacat ttccctcgcg tcrcgttgcg atctcacctt acgacgatct 780 

cgttccagca gcagcagcat cggcagcggc ggcttgcttc cgaagcgagc aatgcatggc 840 

gcgcgcggcc gcgtgcgtgc gtgcctrggc ttgcgctcta atcaaaccgg gacgccccaa 900 

45 cccacggtLg gtgcgggacg ccaccccgcc accttaccgc ccccgcctcc ctgcatctga 960 

tcatcaacca gctgctatat cacctagcta gccgccgcct cctcctcgcc caccaacgtc 1020 

gcttccccgg cacctcac 1038 

<210> 4 
so <211> 1301 

<212> DNA 
<213> Oryza sativa 



35 



55 



<220> 

<221> misc_feature 

<223> PRO0058 - proteinase inhibitor Rgpi9 



21 



EP 1 801 223 A2 



<400> 4 

tctcttctga agctgaagcc ctgcgaaata ggcctttaaa cgctttaagg ttactggatg 60 

atcatatcgg cgtaagaccg gtttaaacat ggtrtcgctt tgtgaatcca atgtgagtca 120 

cgacgtgaca catggcacgt ccttggagct ttagacatat cgaatctgag cactggagtg 180 

5 gccgagtggg tgagcggcca aatccgtttt agacagatcg cactgacacg atgttgatca 240 

ttgatactaa taccatttta tcaagcagta gtg~tgaaaa aaaaaettat gttctcrtca 300 

actgtgagat ttcatcccgt ttcaagatga acaagccatg catgtgagat gtgaacagaa 360 

gycagaayac agtgyaaaga caggacaaat aagtgaagag ggatcaaatc aatgygcctg 420 

acggtttctg aaagttgaca tggaaatcgc cggtgatcac cggtttatac gttatttaaa 480 

10 tctgcgattt ccacttLcgt ttgctttcgg ggtrccaatt tgagtcacgc acatattctt 54 0 

catcgtgctt tggatctcag caccgtagta actrttggac aaattgcatt cgccgacact 600 

aataacatgt tctttttatg ctgstttaca tatactgctt atccacaccc aatcccatgt 660 

tcat&tatta tgagatggag ggagtaaact ttg-taacag caacattttt tatattaaag 72 0 

catcaactaa ttaaagcaca agatacgcat gttatctcaa taaatcttcc agtgcatgta 780 

taaagaagat gtcgccgcta acttagataa ttt-tgtgac ttttatcctg gccggcataa 84 0 

15 ttaattcttc cggaaattaa aagctagttt ttccatattc atcagtacag acaagacagc SO0 

atagtaagcg aagcatacct gacgtgttag ctcattgtaa ctcgatctgg aacactcgat 960 

gctagataca gacagacact cctcgtgatg aacgttagca tttagcaaca tacggtgata 1020 

aagcagctgg ggatcgatcc a~ ccatccat cgtctttaca cgtacttacc ttgctaaccg 1080 

cactgtcgac tcttgcatgt ttgcatgtaa tccaaatgga ccccacgtgg aacatgctca 1140 

20 cagtgctttg cagctgcttt ccaaaatgct ttctttcact tcttccattc ctctgtccac 1200 

aaaaaaagta gtgtgttctt gagcctatat aagagagggt cacacgctcc agtcgactca 1260 

ccatcgatcc atctgacggt tagttccaag ggaaagaaga a 1301 

<210> 5 
25 <211> 1243 

<212> DNA 
<213> Oryza sativa 

<220> 

<221> misc_f eature 
30 <223> PRO0061 - beta-expansin EXPB9 

<4C0> 5 

aaaaccaccg agggacctga tctgcaccgg ttt-gatagt tgagggaccc gttgtgtctg 60 

gttttccgat cgagggacga aaatcggatt cgg-gtaaag ttaagggacc tcagatgaac 120 

35 ttattccgga gcatgattgg gaagggagga cataaggccc atgtcgcatg tgtttggacg 180 

gtccagatct ccagatcact cagcaggatc ggccgcgttc gcgtagcacc cgcggtttga 240 

ttcggcttcc cgcaaggcgg cggccggtgg ccgrgccgcc gtagcttccg ccggaagcga 300 

gcacgccgcc gccgccgacc cggctctgcg tttgcaccgc cttgcacgcg atacatcggg 360 

atagatagct actactctct ccgtttcaca atgtaaatca ttctactatt ttccacattc 420 

40 atattgatgt taatgaatat agacatatat atctatttag attcattaac atcaatatga 480 

atgtaggaaa tgctagaatg acttacattg tgaattgtga aatggacgaa gtacctacga 54 0 

tggatggatg caggatcatg aaagaattaa tgcaagatcg tatctgccgc atgcaaaatc 600 

ttactaattg cgctgcatat atgcatgaca gcctgcatgc gggcgtgtaa gcgtgttcat 660 

ccattaggaa gtaaccttgt cattacttat accagtacta catactatat agtattgatt 720 

tcatgagcaa atctacaaaa ctggaaagca ataaggaata cgggactgga aaagactcaa 780 

4iJ cattaatcac caaatatttc gccttctcca gcagaatara tatctctcca tcttgatcac 840 

tgtacacact gacagtgtac gcataaacgc agcagccagc ttaactgtcg tctcaccgtc 900 

gcacactggc ctt scat etc aggstagctt tctcagccac ccatcgtaca tgtcaactcg 960 

gcgcgcgcac aggcacaaat caegtacaaa acgcatgacc aaatcaaaac caceggagaa 1020 

gaatcgctcc cgcgcgcggc ggcggcgcgc aegzacgaat gcacgcacgc: acgcccaacc 1080 

so ccacgacacg atcgcgcgcg aegoeggega caccggccat ccacccgcgc cctcacctcg 1140 

ccgactataa ataegtagge atetgettga tctzgtcatc catctcacca ccaaaaaaaa 1200 

aggaaaaaaa aacaaaacac accaagccaa ataaaagega caa 1243 

<210> 6 
55 <211> 1019 

<212> DNA 



22 
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<2I3> Oryza sativa 
<220> 

<221> misc_f eature 

<223> PRO0O63 - structural protein 



10 



15 



20 



25 



<400> 6 

cctagctata 

tgcaattaat 

gatgctaagt 

aaggcaatgc 

atatcatcat 

ctaactataa 

tatttcttrt 

gaagtatgtg 

tttcctccac 

ctaacccccc 

tgccactact 

tctctcctcc 

ctccatatca 

aagtattaat 

ccgcagttac 

ccqctataaa 

cagcagcagc 



tgcagaggtt 
tatctgagat 
ccttcattag 
cattaatatg 
caggcaeacc 
gcttataatt 
tttctcctcc 
aatagctagc 
ataagcatat 
ttacatgcaa 
agctcccaac 
tcctcctttc 
tattactcct 
ggggatcaca 
gcctcccacc 
tcctctcccc 
ttgcacactc 



gacaggttgt 
caataaagtt 
tagtatccca 
ccatctctct 
aatccataac 
tatattggag 
tctctctatc 
tcttgtatga 
agttggctta 
tgcaagctgt 
cacaaccaca 
cgttgtgcca 
acatgtgtac 
ggattgcagt 
taccgtcttc 
tcctctctcc 
gagcttagct 



ctcttagatc 
tttctttatg 
catttaatca 
tgttttccat 
tagttcatta 
taaacatgta 
tcttacctat 
gagccaatcc 
tagcctgcta 
craattaaaa 
gatctagcta 
attcatccaa 
tacatttata 
aacagcagca 
tctgccgatc 
ctcctcctcc 
tagcttttgc 



gattaataat 
ttaaattaat 
cagttggaca 
tgcctaccaa 
gagcaagttt 
tagtaaatga 
atatttaatg 
tctgcatttt 
ttatacttgg 
gggtttcaca 
gggtttgttc 
agtcattgag 
ttgatgatct 
ggtaccccct 
gatgacgacg 
aactccacat 
aagagagazc 



arcacattga 
atcagtaata 
cacaaaaaaa 
gtgccatatg 
aataatagag 
gctataaggt 
tatttgtctt 
ttaaattctc 
tcttagtaca 
acattttgaa 
atttctctcc 
agccatacta 
gtaagagcaa 
cctttaacat 
agcttctcct 
cgatcaqcag 
gagctagag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1019 



<210> 

<2i:> 

<212> 
<213> 



7 

1212 
DNA 

Oryza sativa 



30 



<220> 
<221> 
<223> 



misc_f eature 

PRO0081 - putative caffeoyl-CoA 3-O-methyltransf erase 



40 



45 



50 



55 



<400> 7 
atggtgccat 
aaagtgggtc 
ctattatgac 
ctcgtgttga 
attaatataa 
gacaacatcc: 
ttttttggct 
gcgcctacta 
cccaaaaata 
ttaattataa 
ctggtggttg 
ctgaagcttc 
taaacagggt 
ttaggtcgcg 
aat~tcacat 
agtttactgg 
gggcactgat 
cga~tgctaa 
ccaccaaccc 
ccaaccaatc 
gatctcatcg c 

<210> 8 
<211> 1052 



gtcaataaga 
attaataaat 
ggcactattt 
agctgtttct 
tgtcacataa 
tagatgtagg 
gcccanccac 
gatccatcca 
cattcagaac 
gtcgcagcac 
ctactgtact 
ttcttcttcc 
ggacaaaagc 
gcaggtggtg 
gcatcatgca 
accgtctcaa 
catggccact 
tttatttttt 
tgctctcgtg 
gactcgcacc 



catcataata 
acatcatcta 
tctcccaatg 
tacatgagac 
gttaaaagtt 
gttaggagtc 
agtcgccacc 
tcttcgtgac 
acgatctcaa 
cactcattaa 
actagtacta 
ctgggcaaaa 
grgcgatccc 
gtggtaggaa 
tgaagtagta 
cgggaagacc 
acgtttggtg 
gaaaaggaag 
agccgataaa 
accaccagca 



gaaactacac 
tcttttctat 
taaaac-tga 
ccagttrctt 
ctagtaaata 
ttcggacagt 
agaaaattca 
agctccgagc 
atttaaacta 
tcatttccat 
cctgtcgcag 
taattttaag 
tttctttaac 
gaagaagaaa 
catgtagtac 
aaattaacgc 
gctcaacaac 
ggaggaaaaa 
tattgctcgc 
gctcaagcag 



tccacaaccc 
caatcatatt 
taatgtctag 
cttctctcca 
ataatatagt 
agcaaccctg 
ctgtgcccaa 
tttctcctgg 
atggagtgct 
cacaggtaaa 
ctttgtagaa 
caggcggaat 
caaaccacga 
gagaggggaa 
tgagtactgt 
ttataaaata 
caggtcaccg 
agaccgggtc 
cggagctctc 
caacagctca 



atagtttctt 
tattctttat 
tgcataggtt 
ctctctctta 
taatgacata 
ttttgactcc; 
atcaatggaa 
ttatttttct 
actgcatttc 
tcgtggtgag 
gccgttttcg 
aatattggga 
cgaaagcagg 
aaaaaacaaa 
aataatgttc 
cccttttttt 
tgcgatcgat 
tttggtggcg 
ggttgacgac 
aacggaggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1030 
1140 
1200 
1212 
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\ 
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10 



15 



20 



25 



<212> DNA 

<213> Oryza sativa 

<220> 

<221> ir.isc_feature 

<223> PRO0091 - prolamine RP5 



<400> 8 

gtttttctat 

ttgaaatggg 

caatctccta 

acggctacat 

aatacatgcc 

aaatgccttc 

ctgatgatat 

atttagcaac 

caattagagt 

cattaactta 

ccattgaaca 

ccacacagtc 

acaaacttag 

tcatgtcatt 

atgatgatat 

acttagatcc 

atctataaat 

tagttgaagc 



gaaccggtca 
ccttgatgtg 
ttctcggcac 
tacaaggtgt 
aggaagttca 
ttcttttttg 
caggattcct 
acaatatcaa 
aagttggtga 
tttcgtatca 
tactataatt 
gtaaagttcc 
tttcaaaagt 
atttttttgc 
caaagaasat 
taatatagca 
agacaagcat 
atagtagtag 



ttaaaccgtc 
gcccaattgg 
gtgtgatata 
cgcattgtgt 
gaacaatgtg 
cttctgcttc 
tctgtgtctg 
tttggtctat 
gttattgtaa 
caaacaagtt 
ttttttctta 
acgtgtagtg 
tgcaatcctt 
tcaccratcat 
ttttaggtgc 
tcaagcaaaa 
aatgaaaacc 
aatcctacaa 



cccggttaga 
tctgcctaga 
caatggtaag 
caatgtttgg 
ttgcctttca 
ttgagtccat 
tgtaggtgta 
aaaagtatga 
agctctgcaa 
ttcacaagag 
ctgaaattat 
cattatcaaa 
atcacattga 
gtatatatga 
acctaacaga 
ctaacactct 
ctcctcatcc 
aa 



ccgaacaagc 
gcgttttggt 
tgagatatac 
ttaatttgct 
ccggaaaact 
gtggaggaag 
gcaacaccac 
attaaatcaa 
agttaattta 
tattaatgga 
ataattcaaa 
ataatagctt 
cacataaagt 
tgggcataaa 
atatccaaat 
aaagcaaccg 
ttcacacaat 



cacaataatc 
tggcaaaaat 
aattctcggc 
agattcacat 
ttgttggagc 
cagtagatag 
tazaattttt 
tccccaacca 
aaagttattg 
acaatgaaaa 
gagcataaac 
acaaaacata 
gagcgatgag 
agttactttg 
aatatgactc 
atagggaaac 
tcaaacatta 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1052 



30 



35 



40 



45 



50 



55 



<210> 9 

<211> 1216 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> misc_feature 

<223> PRO0095 - putative methionine aminopeptidase 



<400> 9 

cctgatggat 

gatccaagat 

tgtgcaatct 

agagggtttt 

aagtgtagtg 

aagatttatt 

ataaagtttt 

aattggaagt 

gagcatatgg 

agtagtatat 

gcccagtctg 

ggcccatgtt 

tcctcctctt 

cgccgaggtt 

ggggaggggc 

tcctggtgag 

cgcacatcgg 

tagatctgga 

aacagtttgt 

attttggaat 

gaaatattta 



gatgaatcac 
tggtccatga 
agtcagtagt 
ccagtttatc 
atcgcttgat 
attggcagca 
aagttttgag 
acagtttttc 
gcgatcaaaa 
caaaattctg 
gtgagatggg 
ggaaagccca 
cttctagaag 
gccgtctccr 
ggcgatctcc 
gtgagcatcc 
gactggaatt 
agaaactctt 
ttgttccccg 
gtaatttgtc 
cacaat 



tgatcgattt 
aattatcctt 
tgttcaggtc 
ctatgcattt 
gatactagta 
atggatggta 
cagagtttcg 
aaatttccag 
atttcttttc 
aggtaagctc 
cctacccgtg 
ccaaagccca 
actgaaaatt 
tgtctcctcc 
atctccatct 
acgtcctctt 
tgcttgcgtt 
cttcttttaa 
aaaagtttgg 
cctggattta 



ctagttctta 
tcttgatttg 
atgtaaacgt 
gacctctggt 
cacattgctg 
gagagggcaa 
aaaatttgca 
tataaatttt 
taaaggaaaa 
attaggccca 
caggcagaga 
ataatatatc 
cctctccttt 
gctccttgcg 
gaggcgagga 
tctttttttc 
cgttcgttaa 
tttcagagcc 
atgccttcca 
ttcattcagg 



ttctctgaag 
gccctccgag 
acggtaagaa 
catgtattga 
ccttcttttt 
tctgcttctt 
gtagaaagtt 
taaacccact 
atatttttta 
ttcactgtac 
tggatgggcc 
ctcctcacct 
cttctctcgc 
ccgccgccgc 
gagcagggga 
tgattcatct 
gttaaccct.a 
ttaaccttaa 
aatagagaca 
tttgtgat^a 



atgaaccgaa 
aatagattcc 
atttatg-gc 
ttctgagaca 
tgtcctgtaa 
agttttgagt 
tgaaatttca 
gagaaaccaa 
aaaaacactt 
ggcccatgaa 
tttaattgta 
tcaaccctaa 
cctcaccgct 
gacgagtcgc 
ggtgagggga 
ctctctctct 
gcttctcttc 
ragtacaagt 
catgtta-tt 
ctggacaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1216 
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<210> 10 

<211> 1237 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> misc_foature 

<223> PROOlli - uclacyanin 3-like protein 



<400> 10 

tcgttaagtt tgatgatttc tgatgaccca tggtcaccta gccgctagca gtaccatgca 60 

tgatcaccet ccacaaagaa atggtacagt acatctccgt cccaaaataa gtgcagccat 120 

gtatatccat gcctaacgzt tgaccgtccg tcttatttaa aaaaattatg aaaaatttaa 180 

aaatatttag tcacacataa agtattatzc atgtttuar.c atctaatagc aacaaaaaat 24 0 

actaatcata aaattrtttt taataagata aacggttaaa cgttgaacgt gaatagtgca 3 00 

aaacttattt tagaacggag ggagtacgaa gtaactccgg aactacatat agggcaatta 3 60 

ttgccctacg tatgcatata gtcaatcaat taactgctga caatggaaaa gctaatcaat 420 

caatcaatgg tttgattaat caaattaagc caggtcagtc cgtcagtgza cattcactaa 4 80 

ttaaattaac aggtttgttc aacggttcaa ccaacatctg ccatcaacat cttttcgttg 54 0 

cacctttcrt gactctttat gctattttgc taaaaaaaaa cttctcttta catcacttat 600 

aacaatatat atttctgctt taatttgtaa tctttttttt ctgcgttgca acggaaatca 660 

cgagcgatat atggtgaaga ctgatgataa tcgtatttct gatgacccat gattccgcgg 7 20 

tgtaccatct gttctgtcaa ctaaaaag-g gagtagttcc ttgacggaag aagggagcaa 780 

aatagaagat attctcagtt gatctgcagt tgttgttagg tcactatatt cagaaatcgc 840 

agttgctgtt gtttaaattg tgtgtgacag cagacagcta attatcagta cacgtatatg 900 

agc;aatacta gtgaatctgt a^taatttaa cgagagtatt ttctatatac aaatacaaca 960 

gcaaaactgt gccactggcg ccgaatacgt acggacagag ctcaggcaat caggggagca 1020 

gcaaaagagg agagagttgg tgccaagcac aactaaaccc aactgcaccc aaaaactaat 1080 

cagcatttca gttcgcttta gttagtacfea ccacctgcat cxctttacca acactatata 1140 

acccgcagtg gacctgcagt catctcacta attcagtgaa gccaccagta ctagtacggc 1200 

tctaatcagt tcgcgtttgc taattaactc tgccatc 1237 



<210> 11 

<211> 1100 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> misc_f eature 

<223> PRO0116 - 26S proteasome regulatory particle non-ATPase subunit 1 



<400> 11 

ctaagggcag 

acctggtaga 

attttttcgt 

tggtgcggat 

agtactggct 

aagaaaatca 

gatcgttgac 

gattgccgcc 

aataagcaaa 

rtttcacgtg 

actttggacc 

catataaacg 

taagattatt 

aagtgattcc 

agaagggctt 

ggcccacaag 

ggaagaaaga 



cagccattgg 
atatccccga 
acctgtagct 
ggatattcag 
gctcgaacag 
ttcagtattt 
atcttgtttt 
tgaacttttt 
ttgtttggtg 
taccatcaat 
ggataatctt 
atcttgagta 
tctttgaaat 
tcaaaaaaaa 
ggttggagga 
aataataagc 
aagaaacsgc 



gctctatagg 
gatcagtagt 
ttattacatc 
tctaacagac 
gatggttcag 
gocttcttgt 
ttggttcgtt 
cttttttgcg 
gaactaaagt 
ttttttgaaa 
acctttattt 
catcgattcc 
atgataaazt 
aaaagagtga 
aaatggccca 
cccgagccca 
ggccgtcttc 



tgtggttgca 
taccgtgatt 
gcatttcctc 
tcaatgaacg 
cttccagaaa 
tgttacattg 
tgccatggta 
aggatgttat 
aaactcgatc 
gtaaatattt 
aactttgggc 
tacttatcat 
ggatgcatat 
aatttattta 
gattcagatg 
aacgctaagg 
acaccgaagc 



agtgcact^a 
ggttcagact 
ttattgaagt 
ctttgttg-a 
tttggcaacg 
atctcatata 
grttcccttg 
ttttgccaga 
tctttccgag 
ttccccttta 
tatctaactc 
ttaactctcg 
gaatgaaaga 
tttttcccct 
accgaggccg 
cccacgagaa 
ggcggacgag 



caagcgagca 
tgagaggcta 
ttagccgagg 
tgacttgtac 
ctccatttca 
aagtcacttt 
ctgctgggag 
caagaacggg 
aagtgtatta 
actaatgttc 
tcttctaaag 
tagcttaatg 
gtcaaggatt 
ttcgacacga 
agtaccatgg 
gccgtgcgct 
acgactcgca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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gtcgcagcct ctttcctcct ccgtctctct ctcccctctt 
cgaagcgagc gagcggcggc 



;ctctcctcc gcgcggcgaa 



1080 
1100 



10 



<210> 
<211> 
<212> 



12 

1216 
DNA 



<213> Oryza sativa 
<220> 

<221> rr.isc_feature 

<223> PRO0117 - putative 40S ribosomal protein 



75 



20 



25 



30 



<400> 12 

cgtgttcatg 

gaaatgtcat 

attaaaaagg 

tggaattatt 

gacagctttg 

aagtggtact 

agttataaag 

tagagaagct 

gtatatcatg 

acaattcaat 

agtgaaccta 

ctaagaacaa 

ataccacacg 

tttcatttgc 

atcacgacat 

tacgcaccac 

gttttgttca 

ttcccagccc 

acccttcact 

ctcccgccgc 

gccgccgccg 



ttcgcattta 
gtctatgctc 
cggtaagaga 
tggcatagca 
catgtagcca 
gaccttgttt 
attatcataa 
ataacaactc 
acaaaatgat 
gagtcactct 
taatgaaatg 
tttccaaaac 
ataaaattta 
tcctataatg 
gtagccgtga 
catcaaagaa 
ctgtacgttg 
aggcccatta 
atataaacct 
cgccgccgcc 
ccgcca 



ggattggact 
cgatcttata 
actaccacat 
catgcataaa 
caatgctata 
aggcagctag 
tttagataag 
gaagctccct 
acattttgca 
aaaaaaaaag 
aaattttgat 
caccgtccat 
tagggaattt 
ttatcgaaaa 
caacgtggcg 
ttcsagacgg 
cttcggcgtg 
agcctaccaa 
ctctctcctc 
gccgccgccg 



tttttaggat 
aatttgttca 
tttcgaaagc 
gatgctttag 
gtaaatgagt 
cttcattcat 
aatccggtat 
aaacagagcc 
tgggcatatg 
gcaaaacact 
aagcatgctt 
agaaatatgt 
aacttttgcc 
gagagtgttt 
ttcctcgtgg 
agagcgtcgt 
ggcccaatct 
cccggacggc 
ctccggccgc 
cctctccact 



ggagaggara 
atagcgrtgc 
ccattctctt 
taatgagctc 
tgracttctt 
tttttgaatt 
gtttgagaag 
attgaacatt 
tgtctaagaa 
caacaaaacc 
acccaggtgg 
ggaattcatt 
atttttaccg 
gttaattatt 
agcccacccg 
cgccgtcggc 
tgttcgggcc 
ccgggaggag 
cgcctccgaa 
cgagagaccc 



tgtcctaacg 
aaacgcgatc 
cgtgagttac 
aataaaacac 
ttgcattgca 
ctatagttat 
ctggagtttc 
gagctgtcca 
aacaaacatc 
ataccgtgaa 
aaatttcaat 
cagaattttc 
aacaccacct 
tgtcactttt 
tcagccgccg 
aaggcggcgt 
taactag-tc 
ctagggtttc 
gccctagctc 
agccgccgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
120C 
1216 



35 



40 



<2ld> 13 

<211> 1210 

<212> DNA 

<213> Oryza sativa 

<22C> 

<221> misc_f eature 

<223> PRO0122 - chlorophyll a/b-binding protein precursor <Cab27) 



45 



50 



55 



<400> 13 

cagatgccac 

ccaatgtatt 

aaatgttgcc 

gtctatgcag 

tcagtcgctg 

ttgttcagct 

gttccgctgg 

cgcccccccg 

aatctgctgg 

aggacatgaa 

cttacgtcaa 

tgcctttgct 

tatccaattc 

ttttctttca 



agtatggtgt 
tcccgaaata 
tgcaatactc 
ctattcatat 
tctgtaaact 
cgaaactgaa 
aattttatgc 
gggggggggg 
ttggttgcaa 
ttacgattgt 
ggtttctact 
ccagtaaaca 
ccttgttgcc 
ctactccaaa 



accaccagct 
atctatcttt 
gggtctgggt 
tgtaactcag 
gtccggcaat 
atttttcttc 
aggtgctgta 
ggggtttcct 
ataataaccc 
tcatgagcta 
aattacgtga 
tcgctgctct 
gctctagtaa 
acctaaagta 



gctccacacc 
atccgatgta 
atcttctctt 
tgagctccct 
tagaaattcc 
gtgccctata 
ccctatgtgc 
ggcatgattg 
ctccaaatct 
tttggatcat 
ttccgatttc 
agtaacaaac 
aaaacattgc 
catatacgrg 



atgctccacc 
caagcaatna 
caaattttgg 
gtcgcaaatg 
catccttagc 
ttttttcggt 
tgcttttttt 
caaataagaa 
gcgcagatga 
ggaaagattg 
agagtcagcc 
attgcagtaa 
aattatccaa 
agttgagtga 



ggctggccaa 
gagcaat~gc 
gttgtaactc 
tgcctctgcg 
atgcctggta 
gtagataagt 
ttgtgtgggg 
ccccggggca 
aaccccatrc 
gaaacaaaca 
atggctatac 
acatcacaac 
ttcccagata 
uccagcaaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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taaaaatccg 
gcaaccagcc 
cccggcatcc 
caacgcatcc 
tcgccaccgt 
ct.ttgct.ccg 
ctctaagccg 



aggctccgag 
acagctgaga 
ggataacgct 
tggccgtccg 
ggccacctgg 
tgccggccaa 



<210> 14 

10 <211> 1179 

<212> DNA 

<213> Oryza sativa 



cgatctgcac 
cctccatcca 
ggataagagg 
atccaccccc 
cagcgccggc 
aacgtaccct 



caaccatcLc 
atagaaaccc 
cgacgcctcc 
accgccgatc 
cactcccgga 
tgtgactaca 



acccgtccga 
tccctttgat 
cattggccac 
tccgccgtcc 
cagtttaata 
cccgcttcgc 



cgtggcagca 
tcccccgtat 
acccacccaa 
gtcgccgccc 
caagccacgc 
ttcctcccct 



900 
960 
1020 
1080 
1140 
1200 
1210 



15 



<220> 

<221> misc_f eature 

<223> PRO0123 - pucative protochlorophyllide reductase 



20 



25 



30 



AO 



45 



50 



55 



<400> 14 

ttgcagttgt 

tggcttaact 

cattaccact 

gttgcaactt 

gcgttagcgt 

ttaaagttga 

taaatcttat 

c;ataggatag 

ctacttttgt 

ccaagaagag 

cagacccaat 

catgggcacg 

cacccatcag 

acgcccaaac 

ccatatgtta 

ttacgaaaaa 

tcgtagcgga 

gggcccaccc 

accctccgga 

gtacacttga 



gaccaagtaa 
gctagtaaga 
ttacctgaca 
gaaaagtgaa 
aaaaagttga 
aagatggtct 
tccgtcggtg 
agtgtgcata 
actgtatttg 
gcgatgaaaa 
gggcctttcc 
aactcgcgtc 
tggcccacac 
accgcacacg 
cgggcagcca 
atatcccacc 
aaatccgagg 
ccttatcctt 
actcactcct 
gctcactcca 



gctgagcatg 
catttcagaa 
ttttggacag 
atttgttcct 
aatttgggtc 
tttagtaarg 
gtgatttcgt 
tgtgtgttta 
tcaatgaaaa 
gtgcaacgga 
tacgtgtctc 
atcttcccac 
ctcccatgct 
ggtcgcgata 
tgcgcagaag 
acgtgtcgct 
cacgagctgc 
agcccgtggc 
gctggtcacc 
aactcaaaca 



cccttaactt 
ctgagactgg 
agattagaaa 
tgctaatata 
aagttactgg 
gagggagtac 
caatctccca 
tagggatgay 
gaaaatctta 
tatcgtgcca 
ggccacagcc 
gcaaaacgac 
gcattatttg 
gccacgaccc 
atcccgcgac 
ttcacaggac 
gattggctgg 
gctcctcgct 
aacacgaagc 
ctcacacca 



cacctagaaa 
tgtacgcatt 
tagtttcgta 
ttggcgtgta 
tcagattaac 
tacactatcc 
acttagtttt 
tctacgcgcc 
ccaatgctgc 
cgtcggttgc 
agtcgtttac 
agatctgccc 
cgactcccat 
aatcacacaa 
gtcgctgtcc 
aatatctcga 
gaggcgtcca 
cctcgggtcc 
aaaaggacac 



aaagtatact 
tcatgcaagc 
ctacctgcaa 
attcttttat 
cagtaactgg 
tcagctgatt 
tcaatatatt 
ttatgaacac 
gatgctgaca 
caagtcagca 
cgcacgttca 
tatctggtcc 
cccgtcctcc 
cgccacgtca 
cccgtgtcgg 
aggaaaaaaa 
gcgtggtggg 
gtgtataaat 
cacaaacata 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<223> 



15 

1808 
DNA 

Oryza sativa 



misc_feature 

PRO0133 - chitinase Cht-3 



<400> 15 

tttggcgcgg 

tgtaattata 

agtaaatctt 

ggatacactc 

accaaatgct 

ccatagttta 

aattcttazg 

aatgtgcaat 

cagccgctag 

ctaggctgtt 



ggcagaagag 
atatactttt 
tcttgctcat 
gattaaaaca 
ccgaaattcg 
tggtgtttgg 
caattgaatc 
caatatattc 
cactgggtct 
ggcgccaggc 



tggactttaa 
aatcaaaaca 
gatgcttcag 
gctaattgca 
tctatttcga 
taaaatttgg 
ctagaaccct 
caattgcaat 
gttgaggtgc 
actgctgtcg 



ctttcttttt 
tgcaaagcta 
ccggacggaa 
acacatatca 
tgaggcccaa 
traaaatctg 
atcatattr.a 
acatatcaag 
ttcttgcagc 
tgctgcaaca 



aataaaatct 
gcagtattta 
ccctaaaata 
tataaggttt 
gacatgacct 
tctattttag 
tattgcaatt 
catgaggtgt 
aacagctgca 
atggcacart 



ccaattaata 
catcactaga 
tagatggggc 
tggaattcat 
cctgtttcgc 
taggtcccga 
gcacaaaaat 
aatacatatc 
atctgtttgg 
cgtcgagcac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1179 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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acaaecgcgc ctatgcacag 
caaggcttag tggattgagc 
gtgtggactg cggaagatga 
aactgatcag ttggctggat 
caagaagcag aagaggagtc 
gccgccccag gccgcctaat 
tagttatgca tgtccctctt 
aaatcactgg gcatggcaca 
tattgaacca aaaatcgact 
gttgttggat tgattgcacg 
tgttgcctac ataaatctta 
ggttttcgta ccgtatagac 
gtagcacgcc agctccatat 
gagatcaatc tatcaatcgt 
tcaattatat atggtgcatg 
tggaaactta ctttttctaa 
cacaatatat ccacgacggc 
gtacttgtta atcgtggaaa 
cgatgcacca tgcatatcca 
cacactagct acttctactt 
cattgccg 



cgcaagctcg ctgccttgga 
gagaagacga actgacaatg 
atcgagatca atcaattcgt 
cgatggtatg tactagataa 
gggtcgggag tgtggggcga 
gagtgtgtcc gcccctggcc 
tgtctaaaca atatgtataa 
caggagagct actttagcga 
gtaatctcat gaaaattttc 
agaaaacaga agaagggagc 
aagcaatcga atggtctaaa 
cccggccggg tcaaacttat 
atgtggattg cagctggtct 
caaccctttg cctttgttag 
catggcatcc acgcctccac 
ataactgaac ggattggagg 
tagacaatac tagtagatgc 
ctttggtaat gcgaatgcat 
tctctatata aagccatgcg 
ctatcatacc aaacaaacta 



ccgtggttcc agtgttgcat 
ccaaagatgc gatgctgcga 
tatgcttgaa aggctggaat 
tatgcggtct aggcctagac 
cgtaggctgt agctgggccg 
tgacacgatg gg~aattaaa 
aattgacgat atcttgggca 
catgaatc~a ggcgaaaatc 
gtcataatta tagcaaaatc 
taggtgatat tatattgttt 
atttacaaga tttttaaaga 
ttggtcgtcg ctggttgttt 
atgataagtt cggtcgazct 
cgagctagcg tgtacacatt 
ggtcaacgtg gaaatatctc 
caggagacaa attitgaccaa 
atgcatggaa ggatatagta 
ttcaattcgt tgctgaagat 
atcccaccga ttcttgcaca 
gcttaatttg cattgcatca 



<210> 16 

<21I> 1828 

<212> DNA 

<213> Oryza sativa 



<220> 

<221> misc_f eature 
<223> PR0D151 WSI18 



<40C> 16 

gcttgagtca tagggagaaa 
taccggcaaa aaaagtagta 
tccggcttaa tgcttttctt 
cttctgccat cccattatat 
ctagacratgt taaggctgag 
gcgcacatac ttttcaaact 
gttgctttaa aaaattatat 
tcacgtgcta aaagaccgct 
accgaacaca gcctaaatct 
tctaagttac tatatactga 
gatgaagtta ctactagctg 
agcacgtgat taattaagta 
ttaagtaact ctcctataga 
atgtcagtaa aaaataagag 
gtatacagtt ataaactatt 
catgtaccag taccatgaat 
aatatatcac atctgctcta 
tctgcccgtt gctaagcaca 
cgataattga atggaacttc 
ttgcacaaaa caactccggc 
ctgactccct gaggcggacc 
caaagaaagc acagcacatg 
ggcgatctca cagccgcata 
gctcatcgcg gcacrgaccgc 
gagcccatcc gccgcgtcct 
tctctgtctt ttgctccccg 
ggtacacctc cgccggccac 



acaaatcgat catatttgac 
ctggtttata tgtaaagtaa 
ttgtcacata tactgcattg 
agcaactcaa gaatggattg 
ttgggcagtc catcttccca 
actaaatggt gtgtttttta 
tgatccattt ttttaaaaaa 
ccgttttgcg tgcaggaggg 
tgttgtctag attcgtagta 
gatgaataga ataagtaaaa 
cgtttgggag gacttcccaa 
ctagtttaaa aaacttaaaa 
aaacttttac aaaattacac 
agtagaagtt atgaaagtta 
ccctctgttc taaaacataa 
cgaatccaga caagtttttt 
aatatcttat atttcgaggt 
cgccaccccc gatgcgggga 
cacattcaga ttcgataggt 
ctcccggcca ccagtcacac 
tgccactgtt ctgcatgcga 
ctccgggaca cgcgccaccc 
tcgcatttca caagccgccc 
gcacggaacg cacgcggccg 
ccctttgcct ttgccgctat 
gcgcgcgcca gttcggagta 
aacgcgtgtc ccccctacgt 



tcttttccct ccatctc-ct 
gattctttaa ttatgtgaga 
caacaattgc catatattca 
atatatcccc tattactaat 
acccaccacc ttcgtttttc 
aaaatatttt caatacaaaa 
aatagctaat acttaattaa 
ataggttcac atcctgcatt 
ctggatatat taaatcatgt 
ttagacccac cttaagtctt 
aaaaaaaagt atragccatt 
aataaattaa tatgattctc 
cgtttaatag tttggaaaat 
gaaaaagaat tgttttagta 
gggattatgg atggattcga 
atgcatattt attctacLat 
ggagactgtc gctatgtttt 
cgcctctggc cttcttgcca 
gaccgtcgac tccaagtgct 
gactcacggc accaccaccc 
agctatctaa aattctgaag 
ggcggaaaag ggctcggtgt 
atctccaccg gcttcacgag 
acccgcgcgc ctcgatgcgc 
cctctcggtc gtatcccgtt 
ccagcgaaac ccggacacct 
ggccgcgcag cacatgccca 



28 



EP 1 801 223 A2 



TO 



15 



tgcgcgacac gtgcacctcc tcatccaaac tctcaagtct caacggtcct ataaatgcac 1680 

ggatagcctc aagctgctcg tcacaaggca agaggcaaga ggcaagagca tccgtattaa 174 0 

ccagcctttt gagacttgag agtgtgtgtg actcgatcca gcgtagtttc agtrcgtgcg 1800 

ttggtgagtg attccagcca agtttgcg 182 8 

<210> 17 
<211> 1267 
<212> DNA 
<213> Oryza sativa 

<220> 

<221> misc_feature 
<223> PRO0169 - aquaporine 

<400> 17 

cgtcctcctt ttgtaacggc tcgcaaatac aatgggttgt ttagattcat gtcattttaa 60 

atcatattat tttttataaa gttatcaaaa tgtacataca tttatttatt tttaccaaac 120 

tttactaaat gagataatcc aacaaatggc atttaaagcg ttcaaatcca agaaatgcca 180 

tcgccgttat gcttccgtcc gtttcacgcc gttaaaatac aatgttcatc ctataacact 2 40 

taatggtgtg gaatggacgg aaccctaacg gcgatggcat ttttgggata aagtcgtttg 300 

20 tacgatggca tttcttagaa ctcatatttg tcgatggcat tttttgaatt tggatgattg 360 

tcaatggtat tttttggatt atctcttagt aaatacataa ggaatcatgc caaaacttga 4 20 

caatattgnc aacttatcaa aatttaattg ggattatttt ggcgataata tgaacagccc 480 

ttacatttct gaagaattat agctcaaata tggctatggc cctgtttgga ttcggagggc 540 

tatttaatag ccctccggaa tcttgctatt taagagtatt aaacgtagat tactgataaa 600 

25 ac;tc;attcca taacccctac gctattctac gaga'jyaatc taacgaggta tattaateca 660 

tgatttgcta cagtaatcag ccgctaatcg tggattaata tacatcatta gattcgtctc 720 

gtaaaatagg ctagggatta tggaatcggt tttatcggta atctatgttt aatacttcta 780 

aatagcaaga ttccgaaggg ctatttaata gctcggagca tccaaacaag gcctatgttt 840 

agatccaaac ttccaactrt ttctatcaca ttaaactg~c atacatacat aacttttcag 900 

tcacatcgta ccaatttcaa cccaaactzt caactttgga agaactaaac acagcatatg 960 

acagtgcagt tcagctcaat tttgttcgga gcctaaaaaa aagaaaagaa aaaaagctca 1020 

atttggataa ggctatgaat aaactcaaaa aagcatccaa cctaaccacc acactggccc 1080 

accagggccc acgctccact cccgtgatca tcacctcctt ccctttccag aaccaccttc 1140 

tccttccttc ctcctcttct tcttcagtgt actctgcctt tataacaccc tactcctctc 1200 

tctcacctcc accatctagc tcactcacac agtctccact cacacgcatt gcagaggaga 12 60 

35 ggcgaca 1267 

<210> 18 
<211> 1130 
<212> DNA 
40 <213> Oryza sativa 

<220> 

<221> misc_feature 

<223> PRO017C - High mobility group protein 
ifo <4 00> 18 

catgcggcta atgtagatgc tcactgcgct agtagtaagg tactccagta cattatggaa 60 

tatacaaagc tgtaaractc gtatcagcaa gagagaggca cacaagttgt agcagtagca 120 

caggattaga aaaacgggac gacaaatagt aatggaaaaa caaaaaaaaa caaggaaaca 180 

catggcaata taaatggaga aatcacaaga ggaacagaat ccgggcaata cgctgcgaaa 24 0 

50 gtactcgtac gtaaaaaaaa gaggcgcatt catgtgtgga cagcgtgcag cagaagcagg 300 

gatttgaaac cactcaaatc caccactgca aaccttcaaa cgaggccatg gtttgaagca 3 60 

tagaaagcac aggtaagaag cacaacgccc tcgctctcca ccctcccacc caatcgcgac 420 

gcacctcgcig gatcggtgac gtggcctcgc cccccaaaaa tatcccgcgg cgtgaagctg 4 80 

acaccccggg cccacccacc tgtcacgttg gcacatgttg gttatggtrc ccggccgcac 540 

55 caaaatatca acgcggcgcg gcccaaaazt tccaaaatcc cgcccaagcc cctggcgcgt 600 

gccgctcttc cacccagg-c octctcgtaa tccataatgg cgtgtgtacc ctcggctggt 660 



30 
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15 



tgtacgtggg 
cggccccgcg 
gaaaattcaa 
tgcccctgac 
ttctctcctc 
ctccttcctc 
gactctcccc 
gcctcgcgcg 



cgggttaccc 
cgtcatcgcg 
attaggaggt 
ttggcttggc 
tcctcttctc 
ctctcctttc 
aggtgaggtg 
gatctgaccg 



tgggggtgtg 
gggcggggtg 
ggggggcggg 
tcctcttctt 
ttctcttctc 
ccctcctctc 
agaccagtct 
cttccctcgc 



ggtggatgac 
tagcgggtgc 
gcccttggag 
cttatccctt 
tggtggtgtg 
ttcccccctc 
ttttgctcga 
ccttctcgca 



gggtgggccc 
gaaaaggagg 
aataagcgga 
gtcctcgcaa 
ggtgtgtccc 
tcacaagaga 
ttcgacgcgc 
ggattcagcc 



<210> 
<2U> 
<2I2> 
<213> 

<220> 
<221> 
<223> 



19 

1230 
DNA 

Oryza sativa 



rr.isc_f eature 

PRO0171 - reversibly glycosylated protein RGP1 



ggaggaggtc 
cgatcggtac 
atsgcagata 
ccccgcttcc 
tguctcccct 
gagagcgcca 
ctttcacgcc 



720 
78C 
840 
900 
960 
1020 
1080 
1130 



20 



25 



30 



35 



40 



45 



<400> 19 

tagtaccatt 

cacaaagatt 

tcataaaata 

ggttctcatg 

atctagtttt 

tttaaccttt 

aagtgatatt 

aaatgttttg 

atagtaatat 

taatttggta 

ttaactttga 

tctattttag 

actgccacta 

cgatattaag 

ttcaaacctt 

agatccaacc 

caaaacaaaa 

ttttaaaaaa 

tcgtcctccc 

acacacctca 

gagatccagg 



cttccctcgt 
gtaagtatat 
gtaatcatgt 
attgtaaatg 
atcaagacaa 
gagaaactgt 
atatagacaa 
gctttggtca 
ccaagataaa 
atgtaaatat 
ccaaagtcaa 
gaaacggccg 
ttgtgcattt 
atacccctat 
ctacactggt 
caaaataaat 
taggcacaaa 
aaagcaaggc 
ggagctcccg 
cacctcacca 
gagagggaga 



gagcataaat 
ctcgcaacta 
tctccataac 
ttgagtcgct 
acattgcgta 
aaccaattga 
aataagtttc 
aaatcaaact 
tttattataa 
tactatattt 
aacatcttat 
tttctttcca 
catttgattt 
cttttatcca 
acacacgttc 
ttggacgcat 
ataatataat 
aagcgaagca 
ctctataaat 
cc:atcacctc; 
gggagagatc 



gtattcatac 
taaatatgtt 
agtaaatgac 
tgtagcggct 
atgcctcgga 
aactgcaagc 
ttactcttac 
tcttcaagtt 
aaatatattt 
gtctataaac 
aacctgaaat 
ttgattttga 
aacactttta 
ccgcttggaa 
tctctttcca 
cccaaaattc 
actcctatct 
aagcaaagaa 
cgctcctcct 
ctcctcctcc 



<210> 20 

<211> 1234 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> misc_feature 

<223> PRO0173 - cytosolic MDH 



aaaatagtaa 
gtcattttag 
gaggcgttaa 
taagatatag 
cctaatataa 
tttaaaaaaa 
tctctcagtt 
taatcaagtt 
aattattatt 
ttagtcaaat 
ggatggagza 
gataagcaga 
ccccttatct 
caaaccaaaa 
tgcaccgaca 
ggcaaacata 
aattaattat 
ggaaacgaat 
ccccacccac 
tcctcttcct 



aatgtatcct 
taacaattgt 
tagtggttta 
tagagagtat 
aagtaggaat 
catctattgg 
tcaagttara 
tatagaaaaa 
ttaataaaac 
ttaaaacagt 
tttgtttgtt 
gctttaaacc 
ccaataaaaa 
aaaataaaaa 
ggtctctccc 
tgacgcaaac 
acacaatttt 
aacaaagncg 
ccaaacccac 
ucgcgcgcgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1230 



50 



55 



<400> 20 

gtttggttgg 

actttattag 

attgtggcta 

ttctccttct 

aaaatatttt 

gtagctaaaa 

aaccctcact 

ccatccaaac 



tgaccgcaat 
cacattggcc 
acaaattgtr 
cgtttttcgc 
tttacgaaag 
cttaattaat 
cccgaacaca 
aatcctttag 



ttgctatacc 
cgtgcgttat 
ggccacattt 
gcgtacgctt 
ttgcttaaaa 
ctcacgctag 
gcctttgtgt 
tccatataac 



aaaatcttag 
attgtcattt 
tggctacgtt 
tttcaaactg 
aattatatta 
acgctgcttc 
ggtttactac 
ttcgtatact 



acacagttga 
tctagccaaa 
cgataggaca 
ttaaacggrg 
atctattttt 
gttttacgtg 
agttatagta 
ccaaaattcc 



attaagctac 
gtttgccata 
tgttcccaac 
tgttttt-gc 
tttaaaaaaa 
-ccgggtaccc 
aagctagtct 
acrcgttcta 



60 
120 
180 
240 
300 
360 
420 
480 



30 
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cggacatcac taatacgaag atcaagtgga agatagatat ttttaatgac atcttatttt 54 0 

cagtgaacac ttgaggtcct cacgatccac aaacacacat tttcgnagat aagttctgaa GOO 

atactccata cggcggttgt cacgatgtca tgatcgtcgt tacccaagga agaagaaaag 660 

agtggcatct tctccacgcc agtgttccca acggagcatc ttttcttccc ccacacggca 720 

5 tcgacgtcac actttctggt gcaaacttta ataattagtc caaaaacaaa aaaagaattt 7 80 

cggccacatc ttctcccgaa aogccaggtg ggccccacct gcatcactga cagcctgtcc 84 0 

ccacaacgcg cagtcgtgtc cccacctgtc aggatgttag cgtctccgtt gcaggtttcc 900 

cagatcccat cgccgatctg tgggccagcg cccacggtgt cacgcccgcg cacacctggc 960 

tccaacccac ccaccccacg cgctccgtgg ccgacagcgt ggacccaccn agctggggcc 1020 

10 caccgtcagt gggagatggg taggggagcc cccacgtggg agcaacgggg gttctccggg 1080 

ctccccgtcg ccgcgaggtt aaataacggc cacccgttt£ cccctctc-c gcaaaactca 1140 

cccaaaagag cagcgtcgcc tctcctcctc ccccctaacc cctacgctzc cagaaccttc 1200 

tcgaagctcc cgctcccccc ccccttccgc tcca 1234 



15 



30 



<210> 21 

<211> 1553 

<212> DNA 

<213> Oryza sativa 

<220> 

20 <221> misc_f eature 

<223> PRO0175 RAB21 

<400> 21 

gtcaccaccg tcatgtacga ggctgcttca ccactgcctc actgccacca gcgtctcccg 60 

25 ecgcgtgcaa tacaagaaga aacatcgaac ggtcatataa ggtaauaccc actaccgatt 120 

taacctatca ttcccacaat ctaatccact tatttctctt cccatgatct tatcctctca 190 

tttctcctca ctacttttgc atttgtagga aacacaatga caccgtcgaa gaaagctggt 240 

ggagcaccgt agccagcaat caccaaaaca cagaggggag gaggtcggca gcggccatgc 300 

ggacggcgat gagacaacgc gacgcaaaga gggaggagga cgttggcgat catgctggtg 360 

ttggcggagg aggtcactgg ccatgcgaat gacagcgggg cagcgcaaca caaaaagggg 420 

ggaggatgcc ggcgaccacg ctagtaccat gaagcaagat gatgtgaaag ggaggaccgg 4 80 

acgagggtug gacctctgcc gccgacgtga agagcgtgat gtgtagaagg agargttaga 540 

ccagatgccg acgcaactta gccctgcaag tcacccgact gcatatcgct gcttgccctc 600 

gtcctcatgt acacaatcag cttgcttanc tctccatact tgtcgtttgt ttcccgtggc 660 

cgaaatagaa gaagacagag gtgggttt~g ttggagagtt ttagtggtat tgtaggccta 720 

35 tttgtaattt tgttgtactt tattgtatta atcaataaag gtgtttcatt ctattttgac 780 

tcaatgttga atccattgat ctcttggtgt tgcactcagt atgttagaat attcattccg 840 

ttgaaacaat cttggttaag ggttggaaca tttttatctg trcggtgaaa catccgtaat 900 

attttcgttg aaacaattut tarccgacag caccgtccaa caatttacac caatttggac 960 

grgtgataca tagcagtccc caagtgaaac tgaccaccag ttgaaaggta tacaaagtga 1020 

40 acttattcat ctaaaagacc gcagagatgg gccgtggccg tggctgcgaa acgacagcgt 1080 

tcaggcccat gagccattta ttttttaaaa aaatatttca acaaaaaaga gaacggataa 1140 

aatccatcga aaaaaaaaaa ctttcctacg catcctctcc tatctccacc cacggcgagc 1200 

actcatccaa accgtccatc cacgcgcaca gtacacacac atagttatcg tctctccccc 1260 

cgatgagtca cczacccgtgt cttcgagaaa cgcctcgccc gacaccgtac gtgcgccacc 1320 

gccgcgcctg ccgcctggac acgtccggct cctctcccgc cgcgctggcc accgtccacc 1380 

ggctcccgca cacgtctccc tgtctccctc cacccatgcc gtcgcaatcg agctcatctc 1440 

ctcgcctcct ccggcttara aatggcggcc accaccttca cctgcttgca caccacagca 1500 

agagctaagt gagctagcca ctgatcagaa gaacacctcg atctctgaga gtg 1553 

<210> 22 

50 <211> 1087 

<212> DNA 

<213> Oryza sativa 

<220> 

55 <221> misc_feature 

<223> PRO0177 - Cdc2-1 



45 



31 
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10 



15 



20 



<400> 22 

cagacaccta 

atgacttggg 

agtttattac 

aagcgaaaaa 

aagatatcag 

attagacgca 

taaaagatca 

ttttgttttt 

ctgctgccct 

ccaaaaatgc 

ggttgcaagg 

gggctatttc 

atttaaaaat 

caatcccggc 

gcgcacatgc 

agagcgcgtc 

tcgaccaaac 

ggcgctccac 

gcgggcg 



gaatatagac 
tctagtgatg 
ttttcacatg 
ccccgcttac 
aattatatgt 
acataccaag 
cgtattcacg 
taatatttac 
tctccttctc 
atttttggct 
gacctcagtg 
tgtaaatatt 
gaaaaaattc 
ccacatattt 
ggaaaagcgt 
ccctccaacg 
cctaaatcca 
caccaggcgg 



attcccaaaa 
gaagtggata 
atagcataaa 
ccgccccatt 
tttagaatta 
tggcgtatac 
tcacatggag 
ataaatatta 
gagaagagcg 
gggcgaaaat 
caaaatacgc 
ttggatggta 
tatctgggct 
cctgggccca 
acacacgatt 
gctatcccca 
cgcgcattcc 
cggcggcgcc 



aataatcact 
gttccactac 
atttaaagaa 
tacatcccta 
tatgtttttt 
ttggcttcac 
taactgagcg 
taccggcgaa 
gcagggtgat 
tgcacttacc 
acaccttgcc 
taatatttct 
cccttctctc 
tttccgtgtg 
cgaaatttga 
atacaaaaga 
acaccaccca 
ggtgacgacg 



atgcatcagc 
ctacataaaa 
aaaataaaca 
cttggatcct 
tggaaggtgg 
tctttccatc 
aatttttttc 
aatatttaca 
gtcagggaca 
cccttgctgc 
gtcctccact 
gtaaatatta 
atctcacacg 
aatggagacg 
aatctcaaaa 
tcactcgaat 
accagcgaga 
acgacggacg 



atcactatac 
acccactact 
gaagtggaat 
gcatgtcagt 
aaatcggatt 
agagcaagcg 
atutttaaat 
aaagtagacc 
gaaataaact 
cctctacaaa 
tggacggcat 
aaaaataaaa 
gcccaccaca 
gcccattggc 
agcgcccgtt 
cccccccaaa 
gagagatggc 
ggggcgagct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
300 
960 
1020 
1080 
1087 



25 



<210> 23 

<211> 1272 

<212> DNA 

<213> Oryza sativa 



30 



<220> 

<221> misc_£ eature 
<223> TC89946 (PRO0110) 



<220> 
<221> 
<222> 
<223> 



misc_f eature 

(17) . . (17) 

n = any nucleotide 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(50) . . (50) 
n = any nucleotide 



45 



50 



55 



<400> 23 

tttgacgact 

tctgttgtaa 

ctgcgcttgt 

ttcctggccg 

tgcccgacgc 

ttcgggaggt 

atcaaggcca 

gtcgacctcg 

aacctcaacc 

accggcttca 

cggtgattag 

tactacgtta 

tgtttgtacg 

tgtagtactc 

ttgtgtgccg 

aaaactggag 

cggccgaaga 



gaatcgnggc 
ttcggcacga 
gatcgatcga 
tgaacctcgt 
cgacgccgtc 
gcccccgcga 
aggtgggcgt 
aggcggcggt 
tccccatcga 
agtgctaagc 
ttggctttga 
cgtacgtact 
tatcacggta 
ttattcccgc 
gtaccaactt 
ggggggcccg 
taacaacacc 



tcgcctctgc 
gggctgatca 
tctcgggtac 
cgtgctcggg 
gaccccgaca 
cgcgctgaag 
gcctccggcg 
gtgcctctgc 
cctcagcctc 
agcgtgcata 
cgactcttga 
ttgcatgcaa 
ccagtttgga 
tatccgtacg 
ctaataaagc 
ggtccaattt 
gggcttggaa 



ggcggccgct 
agagctctta 
gtagcaatgg 
gtggcaagcg 
ccgtcaacgc 
ctgggcgtgt 
gagccgtgct 
acggccatca 
atcctcaact 
tgcaatgcct 
tttgatttgc 
cgcaacgcat 
ttctctctgt 
tgcgcatttg 
tatgggtgga 
agactataat 
aacctagact 



ctagattagn 
attagctagc 
cgtccaaggc 
cctgcggcgg 
cgacgccgac 
gcgccaacgt 
gcccgctgct 
ggggcaacat 
actgcggcaa 
gcatgggttg 
ttgctgctct 
gatcgatcgt 
actctctcct 
ttgtaagggc 
acttcaaaaa 
gagtttaaca 
gcccaactaa 



gtttcccctg 
tagtgattag 
gttcgctctg 
cagcccgtcg 
gccgtcggcg 
gctgggcctg 
ggaggggctc 
cc-cggaatc 
gaccgtcccc 
atcctacgta 
gtttatttgc 
gcatgctggc 
ttgtcttctt 
cggtgctagc 
aaataaaaaa 
ccccgctcat 
tggacggaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
LC20 



32 
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acagactcrt ggactgaaac tgaacgaaac aagaccaccc accccatcia accacagcca 1080 

cctaccgcca aagattccaa taatgtgaat cagtcggtaa tagaacactc ctcttgtacg 1140 

attttactgc ccgcgccacc cctcggtacg cacttatata tatcgggccg tagtaatttc 1200 

ctggttccgt cacttccctc atcgcacctg ctagtcgtgg cttacatacg tgcgtcctct 1260 

5 tattatcgag eg 127 2 

<210> 24 
<211> 2425 
<212> DNA 
10 <213> Oryza sativa 

<220> 

<221> misc_f eature 
<223> TC90358 (PRO0005) 

15 <220> 

<221> misc_f eature 
<222> (1558) . . (1558) 
<223> n = any nucleotide 

20 <400> 24 

cccacattga ataattattt taaataattt aagttttttt tttttggctt tagatatatt 60 

cccaatcccc aacctcccaa taatccgatc tctcccagtt ctgttcggat caaggctgtg 120 

tegategcaa aaaagaaaaa aaaaacaatt tccttttggg gtggttcatc tgttgatcac 180 

ttctttgttt cccgcgtttt gttggggatt egattttegg gttaagattt tctacacgat 24 0 

25 ggecttgaac ttggctcaga gcgccgcggc; ggcagcgtgc ttcgcgaccg ccugtgatgc 300 

geggegaget gcttcggtgg tcgccatgcc gtegtegteg tcgtcggcca cgacgagcct 360 

gaggatgaag aggcaggegg cgtgcgagcc ggtggcgtgc cgggcggtgg ccaggcacgt 420 

ggcggcggcg geggegagea gcaggaggaa cggcgtgccg gtgttcgtga tgatgecget 480 

ggacacggrg agcaagtgcg ggagegeget gaaceggagg aaggcggtgg cggcgagcct 54 0 

gcggcgctg aagagcgccg gcgtggaggg gatcatggtg gacgtgtggt ggggcatcgt 600 

ggagagegag ggccccggcc ggtacaactt cgacggctac gtggagctca tggagatggc 6 60 

ccgcaagacc ggectcaagg tccaggccgt catgtccttc caccagtgcg gcggcaacgt 720 

cggcgactcc gtcaacatcc cgctcccgag gtgggtggtg gaggagatgg agaaggacaa 7 80 

cgacctcgcc tacaccgacc aatggggacg ccgcaacttc gagtacatct ccctcggctg 840 

cgacgccatg cccgtcttca agggccgcac gcccgtcgag tgctacaccg acttcatgeg 900 

35 cgccttccgc gaccacttcg cctccttcct cggcgacacc ategtcgaaa tecaagtegg 960 

catgggcccc gccggcgagc ttcggtaccc gtcctacccg gagagcaacg gcacctggag 1020 

gttccccggc atcggcgcct tecaatgeaa cgacaggtac atgegtagea gectgaagge 1080 

ggeggeggag gegaggggea ageeggtagt ggggecaegg cgggccgacg gacgccggcg 1140 

gctacaacaa ctggccggaa gacacggtgt tcttccgcgg cgactgcggc gggtggagca 1200 

40 ccgagtaegg cgagttcttc ctgtcgtggt attegcagat getgetggag cacggcgagc 1260 

gcgtgctgtc gggcgcgacg tccgtgttcg gcgacggcgc cggcgccaag ateteggtea 1320 

aggtggccgg catccactgg cacracggca cgcggtcgca cgcgccggag ctcacggcgg 1380 

ggtactacaa cacgcggcac cgcgagcggc tacctcccga tcgcgcgcat gctggcgcgc 1440 

cacggcgccg tgctcaac~t cacctgcgtg gagatgegeg accacgagca geegcaggag 1500 

gcgcagtgca tgcccgaggc getegtcagg caggtggccg ccgcggcgcg cgeggegnga 1560 

cgtcgggctc geeggggaga acgcgctgcc geggtacgae ggcacggcgc acgaccaggt 1620 

ggtcgccgcc gccgccgacc gcgcggcgaa ggaccggatg gtcgccttca cctacctccg 1680 

gatggggece gacctcttcc acccggacaa ctggcgccgg ttcgtcgcct tcgtccgccg 1740 

catgtccgag tccggctcgc cgegggagge cgccgagagc gccgcgcacg gcgtcgcgca 1800 

ggccaccggc tcgctcgtgc acgaggccgc ggtcgcgctc eggagctage aceggtcaga 18 60 

60 cgctcatata caccgtcgcc tegaggtegg attccgatgt gggatcattc gatctccctt 192 0 

ttttttttct tetttttgee attttgtaca gccttttggg gagctttgga tttgtgcttt 1980 

ttgtctcggg aggaaaaccg ctctggaggt cgaagagagc gtcattttcc tcccgttgaa 2040 

gatcacgaat catttaegtt agagatgatg taattaagca gggaggggag gggaacacac 2100 

acacactggc actcaaaagt tgttgtcacg cttggggaat atatccatzt ccagccaaaa 2160 

55 aaaaaacgea gaaatgcgzt gtgttcttgc gctctggttc gttgctgccg tgggtcagat 2220 

tcagctggtg aaaaaactac agtactactg aaactgaaac tactagagee tagagggaga 22 80 



30 



45 



33 



